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PEROXISOME-ASSOCIATED POLYPEPTIDE , NUCLEOTIDE SEQUENCE 
10 ENCODING SAID POLYPEPT IDE AND THEIR USES IN THE DIAGNOSIS 
AND/OR THE TREATMENT OF LUNG INJURIES AND DISEASES. AND OF 
OXIDATIVE STRESS -RELATED DISORDERS 

Field of the invention 

15 The present invention is related to a new 

peroxisome-associated polypeptide, the nucleotide sequence 
encoding said polypeptide and portions thereof as well as 
their uses in the diagnosis of several diseases, especially 
the diagnosis and/or the treatment of lung injuries and 

20 diseases, and of oxidative stress-related disorders. 

Background of the invention 

The peroxisomes are organelles nearly 
ubiquitous in eukaryotic cells. They contain, enzymes 

25 essential for various catabolic and anabolic pathways. Some 
of these enzymes are expressed constitutively while others 
can be induced under appropriate conditions.. Peroxisomes 
carry out a variety of essential reactions such as 
peroxisomal oxidation and - respiration, fatty acid beta- 

30 oxidation, cholesterol and do'lichol metabolism, ether- 
phospholipid synthesis, .and glyoxylate and pipecolic acid 
metabolism. 
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The peroxisomal respiratory pathway is based 
upon the formation of hydrogen peroxide by a collection of 
oxidases and the decomposition of the H.2°2 catalase. 
These reactions 1 are responsible for 20% of oxygen 
5 consumption in liver, and several oxidases have been 
identified in peroxisomes. Ethanol elimination via catalase 
in peroxisomes may be significant in addition to the 
oxidation via cytosolic alcohol dehydrogenase. 

The peroxisomal beta-oxidation system 

10 catalyses the beta-oxidative chain shortening of a specific 
set of compounds which can not be handled by mitochondria : 
very long chain fatty acids, di- and trihydroxycholestanoic 
acids, pristanic acid, long chain dicarboxylic acids, 
several prostaglandins, several leukotrienes , 12- and 15- 

15 hydroxyeicosatetraeonic acid, and several mono- and 
polyunsaturated fatty acids, which are of direct diagnostic 
relevance for some peroxisomal disorders. 

Peroxisomes play also a major role in the 
synthesis of cholesterol and other isoprenoids. Fibroblasts 

2 0 from patients affected by disorders of peroxisome 
biogenesis show low capacity to synthesise cholesterol. 

Two ^ enzyme activities responsible for 
introduction of the characteristic ether linkage in ether- 
linked phospholipids (dihydroacetonephosphate 

2 5 acyltransf erase (DHAPAT) and alkyldihydroxyacetonephosphate 
synthase (alkyl-DHAP synthase) ) are localised in 
peroxisomes. These enzymes are not yet cloned. As 
demonstrated by the identification of patients with 
deficiency of either DHAPAT or alkyl-DHAP synthase with 

30 severe clinical abnormalities, ether-phospholipids are of 
major importance in humans. 
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Peroxisomes are able to detoxify glyoxylate 
via alanine/glyoxylate aminotransferase. The deficiency of 
this cloned enzyme causes hyperoxaluria type I . 
L-pipecolate is a minor metabolite of L-lysine and is 
5 catabolised by the L-pipecolate oxidase localised in 
peroxisomes. The enzyme is deficient in cerebro-hepato- 
renal (Zellweger) syndrome. 

In human, the importance of peroxisomes was 
emphasised by a number of inherited diseases involving 

10 either a' defect in the biogenesis of peroxisomes or a 
deficiency of one (or more) peroxisomal enzymes. So far, 12 
different peroxisomal disorders have been described and 
most of them are lethal . , . 

A wide variety of chemicals have been showed 

15 to produce peroxisome proliferation and induction of 
peroxisomal and microsomal fatty acids-oxidising enzymes 
activities in rats and mice. Several peroxisomes 
prolif erators have been shown to increase the incidence of 
liver tumours in these species. Proposed mechanisms of 

20 liver tumour formation by peroxisomes prolif erators include 
induction of sustained oxidative stress. 

Therefore, newly identified molecules 
associated with peroxisomes could be used for the 
development of diagnostic tools and possibly for the 

25 improvement of several therapeutical applications of 
various diseases associated with peroxisomal disorders. In 
addition, it is useful to identify the molecules present in' 
specific organs like the lung and which may be used as 
specific markers of inflammatory diseases as well as lung " 

30 injuries or diseases. 
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Summary of the invention 

The Inventors have isolated and purified a 
new sequence of a low molecular weight human broncho- 
alveolar polypeptide. Said mammal, preferably human, 
5 protein or . polypeptide (hereafter identified as B18hum 
protein) has been sequenced and its corresponding genomic 
DNA (SEQ ID NO 8) and cDNA (SEQ ID NO 1) have been 
identified. Similarly, the corresponding nucleotide and 
amino acid sequence from a rat (SEQ ID NO 3 and 4) and from 
10 a mouse (SEQ ID NO 5 and 6) have been obtained. 

Said sequences present several homologies 
with other peroxisomal proteins of yeast and comprise a 
carboxy- terminal tripeptide SQL which is necessary for the 
specific targeting and translocation of several proteins 
15 into the peroxisome. 

Therefore, the present invention is related 
to a new isolated and purified polypeptide sequence having 
a amino acid sequence which presents more than 7 0% 
homology, advantageously more than 85% homology, more 
20 preferably more than 95% homology, with the amino acid 
sequence SEQ ID NO 2.; Said amino acid sequence is 
advantageously obtained from a mammal, preferably from a 
rat, a mouse or a human. 

The present invention is also related to the 
25 isolated and purified polypeptide sequence corresponding to 
the amino acid sequence SEQ ID NO 2 or a portion thereof, 
preferably an immunoreact ive portion (putative immunogenic 
domain or T or B cell epitopes) . 

Said portions are advantageously comprised 

3 0 between : 

Glutamic acid position 13 - Glutamic acid position 27 
Alanine position 26 - Leucine position 36 
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- Alanine position 42 - Glutamic acid position 57 

- Glutamic acid position 57 - Valine position 69 

- Valine position 80 - Leucine position 97 

- Arginine position 95 - Leucine position 112 
5 -". Serine position 118 - Serine position 129 

- Valine position 137 - Threonine position 150 

Preferably, -said portion has more than 10, 
20, 30, 50 or 70 amino acids. Specific portions of the 
amino acid sequence SEQ ID NO 2 are also portions of more 
10 than 70 amino acids which present ' at least 80% of the 
proteinic activity (see example 5) of the complete SEQ ID 
NO 2 sequence. Therefore, the amino acid sequence according 
to the invention can be partially deleted while maintaining 
its activity, preferably its ant i -oxidative activity, which 
15 will" be described hereafter. 

According to the invention, the amino acid 
sequence SEQ ID NO 2 presents a pi of 7.16 and a molecular 
weight of 17047 Dalton as hereafter defined by 
bidimensional electrophoresis. 
20 The present invention is also related to the 

nucleotide sequence . enc/oding the amino acid sequence 
according to the -invention and its regulatory sequences 
upstream said coding sequence. A nucleotide sequence 
encoding the polypeptide according to the invention is a 
25 genomic DNA (see SEQ ID NO 10), a cDNA (see SEQ ID NO 1) or 
a mRNA, possibly comprising said upstream regulatory 
sequence. Advantageously, said nucleotide sequence presents 
more than 70%, advantageously more than 85%, more 
preferably more than 95% homology with SEQ ID NO 1 or its 
3 0 complementary strand. 
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According to a preferred embodiment of the 
present invention, said nucleotide sequence corresponds to 
the nucleotide sequence SEQ ID NO 1, its complementary 
strand or a portion thereof. 
5 f, A portion of the nucleotide sequence SEQ ID 

NO 1" means any nucleotide sequence of more than 15 base 
pairs (such as a primer, a probe or an antisense nucleotide . 
sequence) which allow the specific identification, 
reconstitution or blocking of the complete nucleotide 

10 sequence SEQ ID NO 1 (including its regulatory sequences 
upstream the coding sequence) . 

Said portions allow the specific 
identification, reconstitution or blocking by specific 
hybridisation with the nucleotidic sequence SEQ ID NO 1, 

15 preferably under standard stringent conditions, with 
sequences like probes or primers possibly labelled with- a 
compound (radioactive compound, enzyme, fluorescent marker, 
etc.), and can be used in a specific diagnostic or dosage 
method like probe hybridisation (see Sambrook et al . , §§ 

20 9.47-9.51 in Molecular Cloning : A Laboratory Manual, Cold 
Spring Harbor, Laboratory Press, Cold Spring Harbor, New 
York (1989)), genetic amplification (like PCR (US patent 
4,683,195) , LCR (Wu et al . , Genomics 4, pp. 560-569) , CPR 
(US patent 5,011,769)). 

25 Exemplary stringent hybridisation conditions 

are as follows : hybridisation at 42 °C in 50% formamide 5x 
SSC, 2 0 mM sodium phosphate, pH 6.8 washing in 0 . 2x SSC at 

55 °C. It is understood by those skilled in the art that 
variation of these conditions occur based on the length and 
3 0 GC nucleotide content of the sequence to be hybridised. 
Formulas standard in the art are appropriated for 



WO 99/09054 



PCT/BE98/00124 



determining exact hybridisation conditions (see Sambrook et 
al. 

Preferred examples of said nucleotide 
portions are as follows : 

5 . Sequence Position 

5' -gccatcccagcagtggaggtgtttg-3' (SEQ ID NO 11) 217-241 
5' -ttgaacagctctgccaggttcacc-3' (SEQ ID NO 12) 261-284 

5' -tggaggtgtttgaaggggagccag-3' (SEQ ID NO 13) 230-253 

5' -caggttcaccttgttccctggctc-3 ' (SEQ ID NO 14) 247-270 

10 5' rgggtatgggactagctggcg-3 ' (SEQ ID NO 15) 33-52 

5 ' -ctggccaacattccaattgcag-3 ' (SEQ ID NO 16) 747-768 

and the sequences of respectively 601 (SEQ ID. NO 8) , 604 
(SEQ ID NO 9) and 469 (SEQ ID NO 7) base pairs 
corresponding to specific mRNA alternative splicing of the 

15 B18 human nucleotide sequence as described in Figure 4 (the 
known genomic sequence incorporating several introns and 
exons is represented in the sequence SEQ ID NO 10) . 

Said sequences may be used for a genetic 
amplification or a probe hybridisation as above-described. 

2 0 The present invention is also related to a 

vector comprising the necessary elements for the injection, 
transfection or transduction of cells and having 
incorporated one or more of the nucleotide sequences 
according to the invention. The vector according to the 
25 invention is selected from the group consisting of viruses, 
plasmids, phagemides, cationic vesicles, liposomes or a 
mixture thereof. Said vector may comprise also one or more 
adjacent regulatory sequences (such as promoter (s) , 
secretion and termination signal sequence (s) ) , 

3 0 advantageously operably linked to the nucleotide sequence 

according to the invention. 
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The present invention is also related to the 
. cell transformed by said vector and expressing the 
polypeptide according to the invention. 

The nucleotide sequence according to the 
5 invention can be also introduced in said cell by the 
formation of CaP0 4 -nucleic acid precipitate, DEAE-dextran- 
nucleic acid complex or by electroporation. 

Another aspect of the present invention is 
related to an inhibitor of the polypeptide according to the 
10 invention or the nucleotide sequence according to the 
invention (including the upstream sequences like promoter- 
operator regulatory sequence which may be inhibited by a 
cis- and/or transactivating repressor) . Said inhibitor is 
advantageously an antibody or a fragment of said antibody 
15 such as an hypervariable portion of said antibody directed 
against the amino acid or nucleotide sequence of the 
polypeptide according to the invention. Other examples of 
inhibitors according to the invention are antisense 
nucleotide, sequences which allow the blocking of the 

2 0 expression of the nucleotide sequence according to the 

invention. i 

Another aspect of the present invention is 
related to a diagnostic device (such as a diagnostic kit or 
a chromatographic column) comprising an element selected 
25 from the group consisting of the amino acid sequence of 
said polypeptide, its nucleotide sequence, and/or the 
inhibitor according to the invention or a fragment thereof 
as above-described. Said diagnostic device may comprise 
also necessary reactants and media for the diagnostic 

3 0 and/or dosage of the nucleotide and/or amino acid sequence 

of the polypeptide according to the invention, which are 
based upon the method selected from the group consisting of 
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in situ hybridisation, hybridisation by labelled 
antibodies, especially RIA (Radio Immuno Assay) or ELISA 
(Enzymes Linked Immuno-Sorbent Assay) technologies, 
detection upon filter, upon solid support, in solution, in 
5 sandwich, upon gel, dot blot hybridisation, Northern blot 
hybridisation, Southern blot hybridisation, isotopic or 
non-isotopic labelling (by immunofluorescence or 
biotinilised probes) , genetic amplification, (especially by 
PCR or LCR) , double immunodiffusion technique, counter- 
10 electrophoresis technique, haemagglutination or a mixture 
thereof . 

Another aspect of the present invention 
concerns a diagnosis method wherein a biological sample 
from the patient, such as cephalo-rachidian fluid, serum, 
15 blood, plasma, urine, broncho-alveolar lavage, stomach 
lavage, etc., is isolated from the patient, and is put in 
contact with the diagnostic device according to the 
invention for the diagnosis or the monitoring of an injury 
or a disease, preferably a lung injury or an oxidative. 
20 stress-related disorder, affected by the presence of pro- 
oxidant agent or oxidative stress such as specific cardio- 
vascular diseases like arteriosclerosis, neurodegenerative 
disorders (Alzheimer's disease, Parkinson's disease, 
amyotrophic lateral sclerosis) , apoptosis, inf lammatory 
25 reactions, allergic reactions such as asthma, hay fever and 
eczema, high bone mass syndrome, osteopetrosis, 
osteoporosis -pseudoglioma syndrome, and Bardet-Biedl 
syndrome 1. Said diagnosis and monitoring upon one or more 
biological samples obtained from several tissues" from the 
30 patient can be advantageously obtained . by one or more of 
the methods above -described, which could be adapted 



BNSOOCID: <WO___9S09054A2_1_> 



WO 99/09054 



PCT7BE98/00124 



10 

according to the specific biological sample by the person 
skilled in the art. 

Therefore, the product according to the 
invention could be used as a marker for the above- 
5 identified injuries, diseases or disorders in a broad 
spectrum of tissues as shown in the enclosed Figure 1. 

A further aspect of the present invention is 
related to a pharmaceutical composition comprising a 
pharmaceutically acceptable carrier and an element selected 
10 from the group consisting of the nucleotide sequence, the 
amino acid sequence of the polypeptide according to the 
invention, the inhibitor directed against said sequences 
and/or one or more portions thereof. 

A last aspect of the present invention is 
15 related to the use of the pharmaceutical composition 
according to the invention for the manufacture of a 
medicament in the treatment and/or the prevention of lung 
injuries and/or diseases or of oxidative stress-related 
disorders . 

20 The present invention is also related to a 

prevention and/or treatment method of a patient, especially 
a human patient, preferably affected by lung injuries 
and/or diseases or by oxidative stress-related disorders, 
wherein a sufficient amount of the pharmaceutical 

25 composition according to the invention is administered to 
said patient in order to treat, avoid and/or reduce the 
symptoms of said injuries and/or diseases. 

Other injuries and/or diseases which can be 
prevented and/or treated are injuries and/or diseases 

30 affected by the presence of pro-oxidant agents or oxidative 
stress, such as specific cardio-vascular diseases like 
arteriosclerosis, neurodegenerative disorders such as 
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Alzheimer's disease, Parkinson's disease, amyotrophic 
lateral sclerosis, apoptosis and inflammatory reactions and 
some allergic reactions such as asthma, hay fever and 
eczema, high bone mass syndrome, osteopetrosis, 
5 osteoporosis-pseudoglioma syndrome, and Bardet-Biedl 
syndrome 1 . 

The pharmaceutically acceptable carrier 
according to the invention is any compatible non-toxic 
substance suitable for administering the composition 

10 according to the invention to a human patient. 
Pharmaceutically acceptable carriers according to the 
invention suitable for oral administration are the ones 
well known by the person skilled in the. art, such as 
tablets, coated or non-coated pills, capsules, spray-gas, 

15 patches, gels, solutions or syrups. Pharmaceutically 
acceptable carriers vary according to the mode of 
administration ( intravenous , intramuscular , subcutaneous , 
parenteral, etc.), and may comprise also adjuvants well 
known by the person skilled in the art to increase, reduce 

20 and/or regulate humoral, local and/or cellular response of 
the immune system. 

The pharmaceutical composition according to 
the invention may be prepared by the methods, generally 
applied by the person skilled in the art in the preparation 

25 of various pharmaceutical compositions, wherein the 
percentage of the active compound/pharmaceutically 
acceptable carrier can vary within very large ranges, only 
limited by the tolerance of the patient to said 
pharmaceutical composition, and wherein the limits are 

30 particularly determined by the frequency of administration 
and the possible side-effects of the active compounds or 
its pharmaceutically acceptable carrier. 
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Another aspect of the invention is related to 
the use of the diagnostic device according to the invention 
for performing upon the patient or upon a biological fluid 
obtained from the patient, a diagnosis, a dosage and/or a 
5 monitoring of the above-mentioned injuries or diseases or 
oxidative stress-related disorders affecting the patient. 

A further aspect of the present invention is 
related to a cell or a non-human animal, preferably a 
mammal such as a mouse or a rat, transformed by the vector 
10 according to the invention and overexpressing the 
polypeptide according to the invention, or a non-human 
animal, preferably a mammal such as a mouse or a rat, 
genetically modified by a partial or total deletion of its 
genomic sequence encoding the polypeptide according to the 
15 invention (knock-out non-human mammal) and obtained by 
methods well known by the person skilled in the art such as 
the one described by Kahn et.al. (Cell, Vol. 92, pp. 593- 
596 (March 1998) ) . 

Other examples of genetically modified non- 
20 human animals according to the invention may be a 
transgenic non-human animal comprising an inhibitor 
according to the invention, preferably an antisense nucleic 
acid sequence complementary to the nucleotide sequence 
according to the invention so placed as to be transcribed 
25 into antisense mRNA which is complementary to the 
nucleotide sequence according to the invention and which 
hybridises to said nucleotide sequence, thereby reducing or 
blocking its translation. 

Further aspects of the present invention will 
3 0 be described in the enclosed non- limiting examples in 
reference to the following Figures. 
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Brief description of the drawings 



Figure 1 



5 Figure 2 



10 Figure 3 



15 



Figure 4 



Figure 5 



20 



represents a dot blot analysis of mRNA 
encoding the polypeptide according to the 
invention in various types of human tissues, 
represents a Northern blot analysis of mRNA 
encoding the polypeptide according to the 
invention in a rat lung after administration 
of lipopolysaccharides (LPS) inducing an 
inflammatory reaction of the lung, 
represents a Northern blot analysis of mRNA 
encoding the polypeptide according to the 
invention in a rat lung after intraperitoneal 
injection of pneumotoxicants . 

is a schematic representation of the human 
genomic sequence, the complete cDNA sequence 
and the corresponding amino acid sequence, 
represents respectively the alignment of the 
sequences of the human B18 polypeptide 
according to the invention with the 
corresponding rat and mouse sequences. 



25 



Example 1 : Homology bet ween the B18 p o lypep tide 
according to the invention with other l^n^ 
nucleotide or amino acid se quences 
The BLAST 2 . 0 software (gapped BLAST at the 
NCBI Internet site) was used for searching for homologies 
between human B18 (162 amino acids) and known polypeptides 
in databases (GenBank, SwissProt) . Said search did not give 
perfect alignment with known peptides from different 
3 0 species (Table 1) . Homologies of the human B18 cDNA (8 05 
nucleotides) with GenBank, EMBL, DDBJ and PDB deposited 
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nucleotide sequences (Table 2) 
Sequence TAGS (ESTs) were noted. 
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Table 1 ; Homologies of the B18 proteins amjj 

acid) with other- proteins 



Name 


NCBI ID 


Identity (%) 
Homology (%) 


Membrane protein 
(synechocystis sp.) 


1652859 


57/129 (44%) 
81/129 (62%) 


Peroxisomal -like protein - 
(Aspergillus fumigatus) 


2769700 


56/176 (31%) 
90/176 (50%) 


Haein HI0572 hypothetical 
protein (Haemophilus 
influenzae) 


1723174 


53/146 (36%) 
80/146 (54%) 


PMP2 0 ( Schi zosaccharomyces 
pombe) 


AJ002536 


54/161 (33%) 
85/161 (52%) 


Peroxisomal membrane 
protein A (PMP 20) (Candida 
boidinii) 


130360 


59/170 (34%) 
89/170 (51%) 


Peroxisomal membrane 
protein B (PMP 20) (Candida 
boidinii) 


130361 


58/170 (34%) 
88/170 (51%) 


Putative peroxisomal 
protein PMP from yeast 
(Saccharomyces cerevisiae) 


1709682 


41/138 (29%) 
72/138 (51%) 


Alkyl hydroperoxide 
reductase C22 protein 
(Escherichia coli) 


P26427 


36/126 (28%) 
58/126 (45%) 



Table 2 



Name 


Access NO 


Identity 


Human mRNA down- regulated in 
cells infected by adenovirus 5 


U82616 


259/263 (98%) 


Human mRNA down- regulated in 
cells infected by adenovirus 5 


U82615 


300/321 (93%) 
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In the Table 2, an identity of 98% has been 
obtained with the alignment of 259 nucleotides of CDNA B18, 
, which comprises in its totality 805 nucleotides, with 263 
nucleotides of. U82616 CDNA. A similar identity has been 
5 obtained with the U82615 sequence. 

The sequence SEQ ID NO . 1 comprising 805 
nucleotides presents a homology with several EST "sequences 
obtained from a human and from a mouse, having the 
following references : 
10 Human : 

AA130751, N42215, W38597, N91311, N68467, AA187737, 
N68916, W00593, R88950, AA181884, H20154, K6G66G 
Afpuge : 

. AA220019, AA123351, AA087129, AA255021, AA249897, W71344 

15 

Example 2 ; Tissue detection 

A human RNA master Blot (Clontech) containing 
100-500 ng of poly-A + human RNA in each dot (normalised to 
the mRNA expression levels of eight different housekeeping 
20 genes) was hybridised with a 554 bp-long B18 probe labelled 

with 32 P, and quantified j using Phosphorimaging Technology. 
As shown in Figure 1, B18 mRNA is present in all tissues 
examined but predominantly in trachea, lung, kidney, 
thyroid gland, stomach, colon, heart and some regions of 
25 the brain. Highest expression has been noted in the thyroid 
tissue. This presence is probably correlated with the 
possible antioxidant activity of the B18 polypeptide 
according to the invention. 

30 Example 3 : Inflammatory reaction 

Figure 2 represents a Northern blot analysis 
of rat lung mRNA after 6, 4 8 and 72 hours after 
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lipopolysaccharides (LPS) instillation inducing an 
inflammatory reaction in the lung. 

A Northern blot containing 15 /zg of total RNA 
in each lane was hybridised with a 225 bp- long rat B18 
5 probe, stripped and reprobed with a 572 bp-long rat P-actin 
probe, both labelled with 32 P. Northern blot was quantified 
using Phosphorimaging Technology and the B18 mRNA data were 
normalised to P-actin mRNA level. 

10 Example 4 : Pneumo toxic reaction 

Figure 3 represents a Northern blot analysis 
of rat lung mRNA after intraperitoneal injection of 
pneumotoxicants (4-ipomeanol , 1- (3-fyryl) -4 -hydroxypentanone 
(IPO) , methylcyclopentadienyl manganese tricarbonyl (MMT) 

15 or alpha naphtylthiourea (ANTU) ) . These agents are known to 
induce in the lung acute lesions of Clara (IPO) and 
alveolar cells (MMT) as well as increasing the permeability 
of the alveolar/blood barrier (ANTU) . A Northern blot 
containing 15 fig of total RNA in each lane was hybridised 

20 with a 225 bp-long rat B18 probe, stripped and reprobed 
with a 572 bp-long P~act J in probe both labelled with 32 P, 
The Northern blot- was quantified using Phosphorimaging 
Technology and rat B18 mRNA data were normalised to p-actin 
mRNA level . 

25 

Example 5 : Proteinic activity of the B18 polypep tide 

An amino analysis of the complete human B18 
amino acid sequence shows that said polypeptide presents 
specific portions showing an homology with other anti- 
30 oxidant enzymes (starting from a Leucine at position 36 
until a Cysteine at position 4 7) and an other portion 
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having an important homology with beta chains of ATP 
synthase (starting from a Glutamic acid at position 13 
until a Glycine in position 38) . 

Furthermore, the B18 amino acid sequence 
5 according to the invention shows an important homology with 
an Aspergillus fumigatus allergen (34% - identity and 60% 
homology by using clustal V sequence alignment) , especially 
upon the portion of said B18. polypeptide having possible 
antioxidant properties. Therefore, it is possible that a 
10 peroxisomal protein (possibly homologous to B18 protein) is 
able to induce and to bind IgE from patients sensitised to 
Aspergillus fumigatus peroxisomal proteins after an 
induction of the patient immune system with Aspergillus 
fumigatus allergen. This mechanism can be compared to a 
15 reaction obtained with the manganese superoxide dismutase 
(MnSOD) wherein the human MnSOD is able to bind to IgE from 
patients sensitised to Aspergillus fumigatus MnSOD. 

Furthermore, the Inventors have identified a 
portion of the B18 human polypeptide which presents an 
20 homology with a Cyclophilin-binding domain of Candida 
boidinii PMP20 (receptor, of the immuno- suppress ant drug 
cyclosporine A). Said possible Cyclophilin-binding domain 
is starting from the Threonine in position 150 until the 
Leucine in position 161. 

25 

Example 6 : B18 human gene and mRNA alternative splicing 

As represented in the enclosed Figure 4 , the 
Inventors have identified upon the genomic DNA (SEQ ID NO 
10) 5 exons and 5 introns. By RT-PCR (using primers 5 1 - 
30 gggtatgggactagctggcg-3 1 and 5 ' -ctggccaacattccaattgcag-3 ' ) 
and according to the genomic sequence, 4 different cDNAs 
corresponding to the transcription of the said genomic DNA 



10 
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have been identified in human lung and in human brain. A 
first cDNA of 73 6 bp corresponds to the cDNA encoding the 
complete amino acid sequence of the B18 protein according 
to the invention. However, 3 other cDNAs of 601, 604 and 
469 bp were also identified, and comprise specific 
spl icings of one or more exons . 

Therefore, another aspect of the present 
invention is related to said specific portions of the 
complete genomic or CDNA nucleotide sequence according to 
the invention or to specific portions of the complete amino 
acid sequence of the B18 protein according to the 
invention, which could be used also as specific markers of 
the B18 activity, preferably the ant i- oxidative activity. 

15 Example 7 t Knock-out mouse 

Exons of a mouse genomic sequence encoding 
the B18 polypeptide according to the invention have been 
deleted by homologous recombination. Said homologous 
recombination has been obtained with a genetic sequence 

20 comprising a neomycin resistant gene. The targeting vector 
with said gene and a J thymidine kinase (in order to 
eliminate non- homologous recombinants with ganciclovir) has 
been prepared. Said recombination was used for the deletion 
of one or more exons of the B18 polypeptide. After 

25 electroporation of ES cells with the targeting vector, 
positive clones having incorporated homologous 
recombination were identified by Southern blot with 
labelled probes. Aggregation of. said positive clones with a 
morula from a Swiss pseudo-pregnant mouse produces several 

3 0 chimeric mice which survive after birth. Several homozygote 
mice are obtained by cross-breeding and are used as a model 
for the above-mentioned diseases. 
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Similar experiments may be done with another 
mammal whose B18 sequence is known (the B18 sequence of a 
mouse and a rat and their alignment with the human sequence 
is shown in the enclosed Figure 5) . 

5 

Example 8 ; Chromosome loc alisation of human B18 gene 

Radiation hybrid clones (GeneBridge 4 
Radiation Hybrid Panel, Research Genetics) were used for 
performing -chromosome localisation by PCR with two 

10 different pairs of primers (5 ' -caggttcaccttgttccctggctc-3 ' 
(SEQ ID NO 14), 5' - atgttatgcaaccctttgcgacac - 3 ' (SEQ ID NO 
17) and 5 ' -gtgtttgaaggggagccagggaac-3 ' (SEQ ID NO 18), 
5' -agagacagggtttcaccatcttgg-3' (SEQ ID NO 19)). 

The Inventors have located B18 genomic 

15 sequence on human chromosome llql3 . B18 gene has been 
located 7.15-6.1 cR from marker D11S913 between markers 
D11S1963 and D11S4407 (Genome Database internet site) . 

Unknown genes linked to different disorders 
have been localised in the same region of chromosome 11. 

20 Therefore, B18 gene is possibly associated with these 
disorders: 

- atopy (atopic hypersensitivity: asthma, hay fever and 
eczema; MIM n°147050 at OMIM of NCBI internet site) , 

- high bone mass syndrome (MIM n°601884) , 
25 - osteopetrosis (MIM n°259700) , 

- osteoporosis -pseudoglioma syndrome (MIM n°259770) and 

- Bardet-Biedl syndrome 1 (MIM n°209901) . 
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CLAIMS 

1. Amino acid sequence having more than 70% 
homology with the sequence SEQ ID NO 2 . 

2. Amino acid sequence according to claim 1, 
5 having more than 85% homology with the sequence SEQ ID NO 

2. 

3. Amino acid sequence according to claim 1 
or 2, having more than 95% homology with the sequence SEQ 
ID NO 2. 

10 4. Amino acid sequence according to any one 

of the preceding claims, corresponding to SEQ ID NO 2 or an 
immunoreactive portion thereof. 

5. Nucleotide sequence encoding the amino 
acid sequence according to any one of the preceding claims 

15 and presenting more than 70% homology with SEQ ID NO 1 or 
its complementary strand. 

6. Nucleotide sequence according to claim 5, 
having more than 85% homology with the sequence SEQ ID NO 1 
or its complementary strand. 

20 7. Nucleotide sequence according to claim 5 

more than 95% homology wit^h the sequence SEQ ID NO 1 or its 
complementary strand. 

8. Nucleotide sequence according to any one 
of the claims 5 to 7, corresponding to the sequence SEQ ID 

25 NO 1, its complementary strand or a portion thereof 
specific for SEQ ID NO 1 and comprising more than 15 base 
pairs. 

9. Vector comprising the nucleotide sequence 
according to any one of the -claims 5 to 8 . 

30 10 • Inhibitor directed against the amino acid 

or nucleotide sequence according to any one of the claims 1 
to 8. 
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11. Inhibitor according to claim 10, being an 
antibody, preferably a monoclonal antibody, or a portion of 
• said antibody. 

12 . Diagnostic device comprising an element 
5 selected from the group consisting of the amino acid 
sequence according to any one of the claims 1 to 4, the 
nucleotide sequence according to any one of the claims 5 to 
8, the inhibitor according to claim 10 or 11, their 
portions or a mixture thereof. 
10 13. Method for the in vitro detection of lung 

injuries and diseases or oxidative stress-related diseases 
and disorders, especially inflammatory diseases, comprising 
the steps of : 

- isolating a sample from a body fluid of a patient, 
15 preferably a human patient, 

- possibly inhibiting the contaminants present in said 
sample, 

— put in contact said sample with an element selected from 
the group consisting of the amino acid sequence 

20 according to any one of the claims 1 to 4, the 

nucleotide sequence according to any one of the claims 5 
to 8, the inhibitor according to claim 10 or 11, their 
portions or a mixture thereof, and 

— detecting a reaction of a molecule present in said 
25 sample with said element. 

14. Pharmaceutical composition comprising a 
pharmaceutical ly acceptable carrier and an element selected 
from the group consisting of the amino acid sequence 
according to any one of the claims 1 to 4, the nucleotide 
30 sequence according to any one of the claims 5 to 8, the 
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inhibitor according to claim 10 or 11, their portions or a 
mixture thereof. 

15. Use of the pharmaceutical composition 
according to claim 14 for the manufacture of a medicament 

5 for the prevention and/or the treatment of lung injuries or 
diseases, and of oxidative stress-related diseases or 
disorders, such as specific cardio-vascular diseases like 
arteriosclerosis, neurodegenerative disorders such as 
Alzheimer's disease, Parkinson's disease, amyotrophic 
10 lateral sclerosis, apoptosis and inflammatory reactions, 
allergic reactions such as asthma, hay fever and eczema, 
high bone mass syndrome, osteopetrosis, osteoporosis- 
pseudoglioma syndrome, and Bardet-Biedl syndrome 1. 

16. Cell transformed by the vector according 
15 to claim 9 or comprising a partial or total deletion of its 

nucleotide sequence according to any one of the claims 5 to 
8 . 

17. Non-human animal, preferably a mammal, 
transformed by the vector according to claim 9 or 

20 comprising a partial or total deletion of its nucleotide 
sequence according to any ; one of the claims 5 to 8 . 
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C LUST ALi V alignment of human and rat B18 amino acid sequences {Identity: 
90^, Homology: 97.5%): 

B18hum MAPIKVGDAI PAVEVFEGEPGNKWLAELFKGKKGVLFGVPGAFTPGC5K = SEQIDNO* 1 

B18rat MAPIKVGDTIPSVEVFEGEPGKKVNLAELFKDKKGVLFGVPGAFTPGCSK 



B18hum THLPGFVEQAEALKAKGVQWACLSWDAFVTGEWGRAHKAEGKVKI,!^ 

B18rat THLPGFVEQAGALKAKGAQWACLSVNDVTVTAEWGRAHQAEGKVQLL^ FIG. 5a 



B18hum PTGAFGKETDLLLDDSLVSIFGNRRLKRFSMWQDGIVKALNVEPDGTGL 
B18rat PTGAFGKETDLLLDDSLVSLFGNRRLKRFSMVIDKGWKALhA^EPDGTGL 



B18hum TCSLAPNIISQL 
BlBrat TCSLAPNILSQL 



CLUSTAL V alignment of human and mouse B18 amino acid sequences (Identity: 
91%, Homology: 96%) : 

B18hum MAPIKVGDAIPAVEVFEGEPGNKVNLAELFKGKKGVLFGVPGAFTPGCSK 
B 1 8mous e MAP I KVGDAI P S VEVFEGEPGKKVNLAELFKGKKGVL FGVPGAFT P GCS K 



BIBhum THLPGFVEQAZALKAKGVQWACLSWDAFVTGEWGRAHKAEGKVRLIJ^ 
Bl 8mous e THLPGFVEQAGALKAKGAQVVACLSVNDVFVI EEWGRAHQAEGKVRLLAD 



B18hum PTGAFGKETDLLLDDSLVSIFGNRRLKRFSMWQDGIVKALNVEPDGTGL 
B18mouse PTGAFGKATDLLLDDSLVSLFGNRRLKRFSMVT DNGIVKALNVEPDGTGL 



B18hum TCSLAPNIISQL 

B18mouse TCSLAPNILSQL . 

CLUSTAL V alignment of human and rat cDNA sequences (identity: 612/780 
78.5%): 

BIBhum GCCAGGAGGCGGAGTGGAAGTGGCCGTGGGGCGGGTATGGGACTAGCTGG 

B18rat - TG CGTC CTAGGCAG 

B18hum CGTGTGCGCCCTGAGACGCTCAGCGGGCTATATACTCGTCGGTGGGGCCG 

BlBrat CATA GCC GGA TCGGTGCTCCGTGCATCGGCTACTTGGAC — 



B18hum GCGGTCAGTCTGCGGCAGCGGCAGCAAGACGGTGCAGTGAAGGAGAGTGG 
Bl 8 rat GTGCGTGGCAGGCAGAGCAGGCCGG AAAGGAG CAG GTT G G 
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B18hum 
B18rat 



GCGTCTGGCGGGGTCCGCAGTTTCAGCAGAGCCGCTGCAGCCATGGCCCC 
GAGTGTGGTGGGGCCCGCAGCTTCAGCAGTGCCGCGGTGACTATGGCCCC 

★ ** *** **** ****** ******** ***** * * ******** 



B18hum 
B18rat 



AATCAAGGTGGGAGATGCCATCCCAGCAGTGGAGGTGTTTGAAGGGGAGC 

GATCAAGGTGGGAGACACCATTCCCTCAGTGGAGGTATTTGAAGGGGAAC 
************** **** ** ******* * * * *********** * 



B18hum 
B18rat 



CAGGGAACAAGGTGAACCTGGCAGAGCTGTTCAAGGGCAAGAAGGGTGTG 

CTGGAAAGAAGGTGAACTTGGCAGAGCTGTTCAAGGACAAGAAAGGTGTT 
* ** ** ********* ****************** ****** ***** 



B18hum 
B18rat 



CTGTTTGGAGTTCCTGGGGCCTTCACCCCTGGATGTTCCAAGACACACCT 

TTGTTTGGAGTCCCTGGGGCATTTACACCTGGCTGTTCCAAGACCCATCT 
********** ******** ** ** ***** *********** ** ** 



B18hum 
B18rat 



GCCAGGGTTTGTGGAGCAGGCTGAGGCTCTGAAGGCCAAGGGAGTCCAGG 

GCCTGGGTTTGTGGAGCAAGCCGGAGCTCTGAAGGCCAAGGGAGCACAAG 
*** ************** ** * ************ ** ** ** * ** * 



B18hum 
B18rat 



TGGTGGCCTGTCTGAGTGTTAATGATGCCTTTGTGACTGGCGAGTGGGGC 

TGGTGGCCTGTCTGAGTGTTAATGATGTCTTCGTGACTGCAGAGTGGGGT 
*************************** *** ******* ******** 



B18hum 
B18rat 



CGAGCCCACAAGGCGGAAGGCAAGGTTCGGCTCCTGGCTGATCCCACTGG 

CGAGCCCACCAGGCAGAAGGCAAGGTTCAGCTCCTGGCTGACCCCACTGG 
********* **** ************* ************ ******** 



B18hurn 
B18rat 



GGCCTTTGGGAAGGAGACAGACTTATTACTAGATGATTCGCTGGTGTCCA 

AGCTTTTGGAAAGGAGACAGATT TACT ACT AGAT GAT TCTTTGGTGTCTC 
** ***** *********** *** ************* ******* 



B18hum 
B18rat 



TCTTTGGGAATCGACGTCTCAAGAGGTTCTCCATGGTGGTACAGGATGGC 

TCTTTGGGAATCGTCGGCTAAAAAGGTTCTCCATGGTGATAGACAAGGGC 
************* ** ** ** *************** ** * * *** 



B18hum 
B18rat 



ATAGTGAAGGCCCTGAATGTGGAACCAGATGGCACAGGCCTCACCTGCAG 

GTAGTAAAGGCACTGAACGTGGAGCCGGATGGCACAGGCCTCACCTGCAG 
**** ***** ***** ***** +* *********************** 



B18hum 
B18rat 



CCTGGCACCCAATATCATCTCACAGCTCTGAGGCCCTGGGCCAGATTACT 

CCTGGCCCCCAACATCCTCTCACAACTCTGAGGCCCTGA-CCAGA — ATG 
****** ***** *** ******* ************* ***** * 



B18hum 
B18rat 



TCCTCCACCCCTCCCTATCTCACCTGCCCAGCCCTGTGCTGG-GGCCCTG 

TCCTCTGACTCTCCC-ATCTCCTCCACCCAGCTCTGGGCCAAAGGCCCAG 
***** * ***** ***** * ****** *** ** ***** * 



B18hum 
B18rat 



CA ATTGGAATG TTGGCCAGATTTCTGC 

TACCTCCTTACCTGAGGGCCACTGGAATGGAACCTTGACAATATTTCTGC 
* * ******* *** * * ******** 



B18hum 
B18rat 



AATAAACACTTGTGGTTTGCGGAAAAAAA 

AATAAAC AGTT - TAATTT GT GAAAAAAAAAAAAAAAAAAAAA 
******** ** * * * * * * ******* 
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CLUSTAL V alignment of human and mouse cDNA sequences (Identity: 552/675 
81.8%) : 

FIG.5C 

B18hum GCCAGGAGGCGGAGTGGAAGTGGCCGTGGGGCGGGTATGGGACTAGCTGG 
318mouse 

B18hum CGTGTGCGCCCTGAGACGCTCAGCGGGCTATATACTCGTCGGTGGGGCCG 
Bl 8mouse TGCTCCGTG CATCGACGTGCTTG 

Bl8hum GCGGTCAGTCTGCGGCAGCGGCAGCAAGACGGTGCAGTGAAGGAGAGTGG 
B18mouse GCAGGCAG AGCAGGCCGG AAAGAAGCAGGTTGG 

B18hum GCGTCTGGCGGGGTCCGCAGTTTCAGCAGAGCCGCTGCAGCCATGGCCCC 
Bl8mouse GAGTGTGGCGGAGCCCGCAGCTTCAGCAGCTCCGCGGTGACCATGGCCCC 



Bl 8hum AATCAAGGTGGGAGATGCCATCCCAGCAGTGGAGGTGTTTGAAGGGGAGC 
B18mouse GAT CAAGGT GGGAGATGCCATT CC CT C AGT GGAG GT ATTT GAAGGGGAAC 



Bl 8hum CAGGGAACAAGGTGAACCTGGCAGAGCTGTTCAAGGGCAAGAAGGGTGTG 
BISmouse ' CGGGAAAGAAGGTGAACTTGGCAGAGCTGTTCAAGGGCAAGAAAGGTGTT 



B18hum CT GTTTGGAGTT C CT GGGGC CTT CAC C C CT GGAT GTT CCAAGACACAC CT 

BISmouse TTGTTTGGAGTCCCTGGGGCATTTACACCTGGCTGTTCTAAGACCCACCT 



, Bl 8hum GCCAGGGTTTGTGGAGCAGGCTGAGGCTCTGAAGGCCAAGGGAGTCCAGG 
B18mouse GCCTGGGTTTGTGGAGCAAGCTGGAGCTCTGAAGGCTAAGGGAGCGCAGG 



B18hum T G GT GG C CT GT CT GAGTGTTAAT GAT GC CTT TGTGACTGGCGAGT GGGGC 

BISmouse TGGTGGCCTGTCTGAGCGTTAATGACGTCTTTGTGATTGAAGAGTGGGGT 



Bl8hum CGAGCCCACAAGGCGGAAGGCAAGGTTCGGCTCCTGGCTGATCCCACTGG 
B18mouse CGAGCCCACCAGGCAGAAGGCAAGGTTCGGCTCCTGGCTGACCCCACTGG 



B18hum GGCCTTTGGGAAGGAGA.CAGACTTATTACTAGATGATTCGCTGGTGTCCA 
B18mouse AG C CTTT GGGAAG GC GACAGACTT ATT ATT GGAT GATT CTTT G GT GT CT C 



B18hum T CTTT G GGAAT C GAC GT CT CAAG AG GTT CT C CAT G GT G GT AC AG GAT GG C 

BIBmouse TCTTTGGGAATCGTCGGCTGAAAAGGTTCTCCATGGTGATAGACAACGGC 



B18hum ATAGT GAAG G C C CT GAAT GT GGAAC CAG AT G G CAC AG G C CT CAC CT G C AG 

B18mouse ATAGT GAAGG C ACT GAAC GT GGAG C CAG AT G G CAC AG G C CT CAC CT G CAG 



B18hum CCTGGCACCCAATATCATCTCACAGCTCTGAGGCCCTGGGCCAGATTACT 
B18mouse- CCTGGCCCCCAACATCCTCTCCCAACTCTGAGGCCCTGG-CCAGATG 



B18hum TCCTCCACCCCTCCCTATCTCACCTGCCCAGCCCTGTGCTGGGGCCCTGC 

BISmouse TCCTCTGACTCTCCC-ATCTCTCCCACCCGGCTCT AGGCC 

B18hum AATTGGAATGTTGGCCAGATTTCTGCAATAAACACTTGTGGTTTGCGGAA 

BIBmouse AAAAGGCTCGGTACC7CCTTACTGGGAGC- CACGT 
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(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: UNIVERSITE CATHOLIQUE DE LOUVAIN 

Halles Universitaires 

(B) STREET: Place de 1" Universite, 1 

(C) CITY: LOUVAIN- LA-NEUVE 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP) : B-1348 

(A) NAME: UNIVERSITE DE MONS-HAINAUT 

(B) STREET: Place du Pare 20 

(C) CITY: MONS 

(E) COUNTRY: BELGIUM 

(F) POSTAL CODE (ZIP): B-7000 

(ii) TITLE OF INVENTION: PEROXISOME- ASSOCIATED PEPTIDE, NUCLEOTIDE 
SEQUENCE ENCODING SAID PEPTIDE 'AND THEIR USES IN THE 
DIAGNOSTIC AND/ OR THE TREATMENT OF LUNG INJURIES AND 
DISEASES , AND OF OXIDATIVE STRESS -RELATED DISORDERS 

(iii) NUMBER OF SEQUENCES: 19 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 193. .681 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 
GCCAGGAGGC GGAGTGGAAG TGGCCGTGGG GCGGGTATGG. GACTAGCTGG CGTGTGCGCC 60 
CTGAGACGCT CAGCGGGCTA TATACTCGTC GGTGGGGCCG GCGGTCAGTC TGCGGCAGCG 12 0 
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GCAGCAAGAC GGTGCAGTGA AGGAGAGTGG GCGTCTGGCG GGGTCCGCAG TTTCAGCAGA 180 

GCCGCTGCAG CC ATG GCC CCA ATC AAG GTG GGA GAT GCC ATC CCA GCA 22 8 

Met Ala Pro lie Lys Val Gly Asp Ala He Pro Ala 
1 5 io 

GTG GAG GTG TTT GAA GGG GAG CCA GGG AAC AAG GTG AAC CTG GCA GAG 27 6 

Val Glu Val Phe Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu 
15 20 25 

CTG TTC AAG GGC AAG AAG GGT GTG CTG TTT GGA GTT CCT GGG GCC TTC 32 4 

Leu Phe Lys Gly Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe 
30 35 4 0 

ACC CCT GGA TGT TCC AAG ACA CAC CTG CCA GGG TTT GTG GAG CAG GCT 372 
Thr Pro Gly Cys Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala 
45 50 55 60 

GAG GCT CTG AAG GCC AAG GGA GTC CAG GTG GTG GCC TGT CTG AGT GTT 42 0 

Glu Ala Leu Lys Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val 
65 70 f 75 

AAT GAT GCC TTT GTG ACT GGC GAG TGG GGC CGA GCC CAC AAG GCG GAA 4 68 

Asn Asp Ala Phe Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu 
80 85 90 

GGC AAG GTT CGG CTC CTG GCT GAT CCC ACT GGG GCC TTT GGG AAG GAG 516 
Gly Lys Val Arg Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu 
95 100 105 

ACA GAC TTA TTA CTA GAT GAT TCG CTG GTG TCC ATC TTT GGG AAT CGA 564 
Thr Asp Leu Leu Leu Asp Asp Ser Leu Val Ser He Phe Gly Asn Arq 
HO 115 120 

CGT CTC AAG AGG TTC TCC ATG GTG GTA CAG GAT GGC ATA GTG AAG GCC 612 
Arg Leu Lys Arg Phe Ser Met Val Val Gin Asp Gly He Val Lys Ala 
125 130 135 140 

CTG AAT GTG GAA CCA GAT GGC ACA GGC CTC ACC TGC AGC CTG GCA CCC 660 
Leu Asn Val Glu Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro 
145 150 ~ 155 

AAT ATC ATC TCA CAG CTC TGA GGCCCTGGGC CAGATTACTT CCTCCACCCC 711 
Asn He He Ser Gin Leu * 
160 

TCCCTATCTC ACCTGCCCAG CCCTGTGCTG GGGCCCTGCA ATTGGAATGT T GGC CAG ATT 771 
TCTGCAATAA ACACTTGTGG TTTGCGGAAA AAAA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



805 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
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Met Ala Pro lie Lys Val Gly Asp Ala lie Pro Ala Val Glu Val Phe 
15 10 15 

Glu Gly Glu Pro Gly Asn Lys Val Asn Leu Ala Glu Leu Phe Lys Glv 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 - 45 

Ser Lys Thr Kis Leu Pro Gly Phe Val Glu Gin Ala Glu Ala Leu Lys 
5 0 55 60 

Ala Lys Gly Val Gin Val Val Ala Cys Leu Ser Val Asn Asp Ala Phe 
65 7 0 75 80 

Val Thr Gly Glu Trp Gly Arg Ala His Lys Ala Glu Gly Lys Val Arg 
85 90 " 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 no 

Leu Asp Asp . Ser Leu Val Ser He Phe Gly Asn Arg Arg Leu Lys Arq 
115 120 i 2 5 

Phe Ser Met Val Val Gin Asp Gly He Val Lys Ala Leu Asn Val Glu 
I 30 135 i 4 o 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He He Ser 
145 150 155 leo 

Gin Leu * 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(XV) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 136. . 624 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGCGTCCTAG G GAG CAT AG C CGGATCGGTG CTCCGTGCAT CGGCTACTTG GACGTGCGTG 60 
GCAGGCAGAG CAGGCCGGAA AGGAGCAGGT TGGGAGTGTG GTGGGGCCCG CAGCTTCAGC 120 
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AGTGCCGCGG TGACTATGGC CCCGATCAAG GTGGGAGACA CCATTCCCTC AGTGGAGGTA 18 0 

TTTGRAGGGG AACCTGGAAA GAAGGTGAAC TTGGCAGAGC TGTTCAAGGA CAAGAAAGGT 24 0 

GTTTTGTTTG GAGTCCCTGG G G CATTT AC A CCTGGCTGTT CCAAGACCCA TCTGCCTGGG 30 0 

TTTGTGGAGC AAGCCGGAGC TCYGAAGGCC AAGGGAGCAC AAGTGGTGGC CTGTCTGAGT 3 60 

GTTAATGATG YCTTCGTGAC TGCAGAGTGG GGTCGAGCCC ACCAGGCAGA AGGCAAGGTT 420 

CAGCTCCTGG CTGACCCCAC TGGAGCTTTT G GAAAGGAG A CAGATTTACT ACTAGATGAT 4 80 

TCTTTGGTGT CTCTCTTTGG GAATCGTCGG CTAAAAAGGT TCTCCATGGT GATAGACAAG 54 0 

GGCGTAGTAA AGGCACTGAA CGTGGAGCCG GATGGCACAG GCCTCACCTG CAGCCTGGCC 600 

CCCAACATCC TCTCACAACT CTGAGGCCCT GACCAGAATG TCCTCTGACT CTCCCATCTC 660 

CTCCACCCAG CTCTGGGCCA AAGGCCCAGT ACCTCCTTAC CTGAGGGCCA CTGGAATGGA 72 0 

AC CTT GACAA TATTTCTGCA AT AAACAGT T TAATTTGTG& AAAAAAAAAA AAAAAAAAAA 7 80 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus Rattus 

(ix) FEATURE: 

(A) NAME/ KEY : Modif ied-site 

(B) LOCATION: 17 

(D) OTHER INFORMATION: /product= "Glu or Gly M 

(ix) FEATURE: 

(A) NAME/ KEY: Modif ied-site 

(B) LOCATION: 63 

(D) OTHER INFORMATION : /product^ "Leu or Pro" 

(ix) FEATURE: 

(A) NAME/KEY: Modif ied-site 
<B) LOCATION: 79 

(D) OTHER INFORMATION :/product= "Ala or Val" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ala Pro lie Lys Val Gly Asp Thr lie Pro Ser Val Glu Val Phe 
15 10 15 
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Xaa Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Asp 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Xaa Lvs 
50 55 60 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Xaa Phe 
65 70 75 * 8Q 

Val Thr Ala Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Gin 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Glu Thr Asp Leu Leu 
100 105 no 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arc 
115 120 125 

Phe Ser Met Val He Asp Lys Gly Val Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He Leu Ser 
145 150 155 160 . 

Gin Leu 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

<ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 99. . 588 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

TGCTCCGTGC ATCGACGTGC TTGGCAGGCA GAGCAGGCCG GAAAGAAGCA GGTTGGGAGT 60 

GTGGCGGAGC CCGCAGCTTC AGCAGCTCCG CGGTGACCAT GGCCCCGATC AAGGTGGGAG 12 0 

ATGCCATTCC CTCAGTGGAG GTATTTGAAG GGGAACCGGG AAAGAAGGTG AACTTGGCAG 180 

AGCTGTTCAA GGGCAAGAAA GGTGTTTTGT TTGGAGTCCC TGGGGCATTT ACACCTGGCT 24 0 
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GTT CTAAGAC CCACCTGCCT GGGTTTGTGG AGCAAGCTGG AGCTCTGAAG GCTAAGGGAG 300 

CGCAGGT GGT GGCCTGTCTG AGCGTTAATG ACGTCTTTGT GATTGAAGAG TGGGGTCGAG 3 60 

CCCACCAGGC AGAAGGCAAG GTTCGGCTCC TGGCTGACCC CACTGGAGCC TTTGGGAAGG 42 0 

C GACAGACT T ATTATT GGAT GATTCTTTGG TGTCTCTCTT TGGGAATCGT CGGCTGAAAA 4 80 

GGTTCTCCAT GGTGATAGAG AAC GGCATAG TGAAGGCACT GAACGTGGAG CCAGATGGCA 54 0 

CAGGCCTCAC CTGCAGCCTG GCCCCCAACA TCCTCTCCCA ACTCTGAGGC CCTGGCCAGA 600 

TGTCCTCTGA CTCTCCCATC TCTCCCACCC GGCTCTAGGC CAAAAGGCTC GGTACCTCCT 660 

TACT GG GAG C CACGT 675 
(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mouse 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Pro He Lys Val Gly Asp Ala He Pro Ser Val Glu Val Phe 
15 10 15 

Glu Gly Glu Pro Gly Lys Lys Val Asn Leu Ala Glu Leu Phe Lys Glv 
20 25 30 

Lys Lys Gly Val Leu Phe Gly Val Pro Gly Ala Phe Thr Pro Gly Cys 
35 40 45 

Ser Lys Thr His Leu Pro Gly Phe Val Glu Gin Ala Gly Ala Leu Lvs 
50 .55 eo 

Ala Lys Gly Ala Gin Val Val Ala Cys Leu Ser Val Asn Asp Val Phe 
65 70 75 80 

Val He Glu Glu Trp Gly Arg Ala His Gin Ala Glu Gly Lys Val Arg 
85 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Phe Gly Lys Ala Thr Asp Leu Leu 
100 105 no 

Leu Asp Asp Ser Leu Val Ser Leu Phe Gly Asn Arg Arg Leu Lys Arq 
115 120 125 
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Phe Ser Met Val lie Asp Asn Gly He Val Lys Ala Leu Asn Val Glu 
130 135 140 

Pro Asp Gly Thr Gly Leu Thr Cys Ser Leu Ala Pro Asn He Leu Ser 
145 150 155 160 

Gin Leu 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 161. .382 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



GGGTATGGGA 


CTAGCT GGCG 


TGTGCGCCCT 


GAGACGCTCA 


GCGGGCTATA 


TACTCGTCGG 


60 


TGGGGCCGGC 


GGTCAGTCTG 


CGGCAGCGGC 


AG CAAGAC GG 


TGC AGT GAAG 


GAGAGTGGGC 


120 


GTCTGGCGGG 


GTCCGCAGTT 


TCAGCAGAGC 


CGCTGCAGCC 


ATGGCCCCAA 


TCAAGGTTCG 


180 


GCTCCTGGCT 


GATCCCACTG 


GGGCCTTTGG 


GAAGGAGACA 


GACTTATTAC 


TAGATGATTC 


240 


GCTGGTGTCC 


ATCTTTGGGA 


ATCGACGTCT 


CAAGAGGTTC 


TCCATGGTGG 


TACAGGATGG 


300 


CAT AGT GAAG 


GCCCTGAATG 


TGGAACCAGA 


TGGCACAGGC 


CTCACCTGCA 


GCCTGGCACC 


360 


CAAT AT CAT C 


TCACAGCTCT 


GAGGCCCTGG 


GC CAGATT AC 


TTCCTCCACC 


CCTCCCTATC 


420 


TCACCTGCCC 


AGCCGTGTGC 


TGGGGCCCTG 


CAATT GGAAT 


GTTGGCCAG 




469 


(2) INFORMATION FOR SEQ ID NO: 8: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 601 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

<ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 161. .514 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGT GGGC 12 0 

GTCTGGCGGG GTCCGCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA T C AAG AC AC A 18 0 

CCTGCCAGGG TTTGTGGAGC AGGCTGAGGC TCTGAAGGCC AAGGGAGTCC AGGTGGTGGC 24 0 

t 

CTGTCTGAGT GTTAATGATG CCTTTGTGAC TGGCGAGTGG GGCCGAGCCC ACAAGGCGGA 3 00 

AGGCAAGGTT CGGCTCCTGG CTGATCCCAC TGGGGCCTTT GGGAAG GAGA CAGACTTATT 360 

ACTAGATGAT TCGCTGGTGT CCATCTTTGG GAATCGACGT CTCAAGAGGT TCTCCATGGT 420 

GGTACAG GAT GGCATAGTGA AGGCCCTGAA TGTGGAACCA GAT GGCACAG GCCTCACCTG 4 80 

CAGCCTGGCA CCCAATATCA TCTCACAGCT CTGAGGCCCT GGGCCAGATT ACTTCCTCCA 54 0 

CCCCTCCCTA TCTCACCTGC CCAGCCCTGT GCTGGGGCCC TGCAATTGGA ATGTTGGCCA 600 

G 601 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 604 base pairs 

( B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 161. .517 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
GGGTATGGGA CTAGCTGGCG TGTGCGCCCT GAGACGCTCA GCGGGCTATA TACTCGTCGG 60 
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TGGGGCCGGC GGTCAGTCTG CGGCAGCGGC AGCAAGACGG TGCAGTGAAG GAGAGTGGGC 
GTCTGGCGGG GTCCGCAGTT T C AG CAGAGC CGCTGCAGCC ATGGCCCCAA TCAAGGTGGG 
AGATGCCATC CCAGCAGTGG AGGTGTTTGA AGGGGAGCCA GGGAACAAGG TGAACCTGGC 
AGAGCTGTTC AAGGGCAAGA AGGGTGTGCT GTTTGGAGTT CCTGGGGCCT TCACCCCTGG 
ATGTTCCAAG GTTCGGCTCC TGGCTGATCC CACTGGGGCC TTTGGGAAGG AGACAGACTT 
ATTACTAGAT GATTCGCTGG TGTCCATCTT TGGGAATCGA CGTCTCAAGA GGTTCTCCAT 42 0 

GGTGGTACAG GAT GGCATAG TGAAGGCCCT GAATGTGGAA CCAGATGGCA CAGGCCTCAC 4 80 

CTGCAGCCTG GCACCCAATA TCATCTCACA GCTCTGAGGC CCTGGGCCAG ATTACTTCCT 54 0 

CCACCCCTCC CTATCTCACC TGCCCAGCCC TGTGCTGGGG CCCTGCAATT GGAATGTTGG 600 
CCAG 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 2516. .2710 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 2 07 4. .2135 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1932. .1970 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1728. .1859 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 802. .936 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



120 
180 
240 
300 
360 



604 
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TCTGTCCCTT AGCGCCCCCG CGGGGGCTTA CCCCATCCCA CTCCATGACC TCCCCTCCCC 60 

CCATGGCGAA TTCCCACCTT TCTGTCTTTC ACTCACTTCC TGGAACCGTC CCCAGGGCCT 120 

TGGACCTTCC CCCTTCTCCT CCCAAACCTT GTGAGACCCC ATTCCCTTTC TACTTCATCC 180 

TGCTCTCAAC TTTTGGGCTC CTCAGAGGCC CTCACCCCTG ACTCTCTCTC CCTACCCACT 24 0 

CTGGTCCCAT GAAGCCCTCA AGTACTCTGG GGATGGATCC TTCCCCCTTC AAAAGATTCC 300 

TTCTTTTGTT CTACACCTCC TGGGTGTAGG GGCCTGGACA CCCTCCCCCA ACGTTCCACC 360 

TGCCGCTGCC CTTCCTCTTC CTCCTCCTGA GGGTGGGACC CTCAGACCTG GCCAAGATCC 420 

TCTCCCTCCA TGTTGTCAGG GACTCCTCCT CACCCCCAAA TACAGCCCTC TAGCCCCTGT 4 80 

C CATTTTATT CCACTCCTTT CCTGTAACCT AGACAGC AT G TTATGCAACC CTTTGCGACA 54 0 

CATGGGGAAA CCTTCCCTCC CTTCCTCTGT TGTCACCAAT GGCCCCTTAA GAGGAGCAGG 600 

GCCACCTTGA AACTTGGAGG ATATGGGGTA ACCCAGTGGfe AGCGGGCAGG GAGGGCCCTT 660 

GGAAACT GAC AGGGCTGGAG TATCCTGCTG GGTTTCAGCC CCGGTTCCTG CAGGCACAGC 72 0 

TGCCAGGCTC TCTGTTCACC TTCCTGCCTC TGGTTTGCCC CGGCTCCCTC ACCCCCCTTA 78 0 

CCCTGGAGTC CTTCCTTCTA GGTGGGAGAT GCCATCCCAG CAGTGGAGGT GTTTGAAGGG 840 

GAGCCAGGGA AC AAG GTGAA CCT GGCAGAG CTGTTCAAGG GCAAGAAGGG TGTGCTGTTT 900 

GGAGTTCCTG GGGCCTTCAC CCCTGGATGT TCCAAGGTGA GGCCCTTCCC CTTCTGAAGA 960 

TCAGGACCTG GGGAT CTTTT GTGTTGCTCT TAAGTCCTCC ACATAGTCCT GAT AGGACT C 1020 

CTAAAAAGCA TTTCAGTGCC ATCACAAAAC AAGTAGAGCT GGGTAGAGCT GGGCGCGGTG 108 0 

GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGTGGATCAC GAGGTCAGGA 114 0 

GT C CAAAAC C AGCCTGGCCA AGATGGTGAA ACCCTGTCTC TACTAAAAAT GCAAAAAAAT 12 0 0 

CAGCCGGATA TGGTGGCGGG CGCCTGTAAT CCCAGGTATT GGGGAGGCTG AGGCAGAGAA 1260 

TTGCTTGAAC CCAGGAGGCG TAGGTTGCAG TGAGTGGAGA TCGTGCCTCT GCAGTCCAGC 1320 

CTGGGTGAAA GAGC GAGACT CCGTCTCAAA ATGAAAAAAA AAAAAGAAAA CAAGTAGAGA 138 0 

CTGCAAAAAG GGAACAGTAC CGGGAATGTT GGAGAAAAAC ATACTACAAT TAAATCCAAC 14 4 0 

ACCCCTGTTG GTCCTGCTAA AT GAC AG G C A CTGTGGAAGG TGCTTGGGAC TCAGATAAAT 1500 

AAGACAAAGA TCTGCCCATG GAAAGTT CAC GTCTGGACCA TAAGGCATTA GGTTTCATTC 15 60 

TGAGCTTCCT AGT GGCCAAG GCAAAAAGGA AATAGAATGG TTT AGACAGC TCTCATTGTC 162 0 

T GAT CAAAGG TGTTGAGGCA GAGCACTGAG GAGGGCCTGG AGATAAAGGG TGGGCTGGGG 168 0 

GT CAG AT G CA GTTATCCCTT TGCCGACCCT TTGTTCCCCT TCCTCAGACA CACCTGCCAG 174 0 

GGTTTGTGGA GCAGGCTGAG GCTCTGAAGG CCAAGGGAGT CCAGGTGGTG GCCTGTCTGA 18 00 

GTGTTAATGA TGCCTTTGTG ACT GGC GAGT GGGGCCGAGC CCACAAGGCG GAAGGCAAGG 18 60 
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TGAGGTGAGG GGCCTGCAGG GAG T C AGGAC CAGGTAGGAT ATTCTTCTTG TGACCTCTAC 1920 

TTTCTCTGCA GGTTCGGCTC CTGGCTGATC CCACTGGGGC CTTTGGGAAG GTGAGTGTTC 1980 

CCCTGACCGC CACAGGGACA TGGCGGTGCG GGGAGCAGTG GGGGCCCTTG GCCTCTTCAA 2 04 0 

GGATTTCTGA CACTTTTCTC TGTCTCTTCT TAG G AG ACAG ACTTATTACT AGATGATTCG 2100 

CTGGTGTCCA TCTTTGGGAA TCGACGTCTC AAGAGGTAAA AGTGGAGAGT CCTCTGTGGA 2160 

GAAAGTCCTC TGTGGGAGAG AGTCCTCTGT GGGAGAGAGT CCTCTGTGGA GAGGGTCCTC 2220 

TGTGGGAAGA GTCGTCTGTG GGGGAGATGT GTGGGAGAGA GTCCTGTGTG GGGAGAGTCT 22 8 0 

TCTGTAGGGG AGAGTCCTCT GGGGAGAGAG TCCTGTGTGG GGGAGAGTCC TCTGTGGGGA 234 0 

GAGTCCTCTG TGTGGAGAGA GTCCTGTGTG GTGGTGAGTC CTCTGTGGGG GAGAGTCCTC 24 00 

TGTGGGGGGA GTCCTCTCTG GAGTTCTCTT GGGCCCCTGG CTGTTCACTG CCTGTCTCCA 2 4 60 

TGCCCAGCCT CCAAGCCCAG GCTGATGCAG CTGGCTGGGC CCCTCTTTCC GGCAGGTTCT 252 0 

CCATGGTGGT ACAGGAT GGC ATAGT GAAGG CCCTGAATGT GGAACCAGAT GGCACAGGCC 25 80 

TCACCTGCAG CCTGGCACCC AAT AT CAT CT CACAGCTCTG AGGCCCTGGG CCAGATTACT 2 64 0 

TCCTCCACCC CTCCCTATCT CACCTGCCCA GCCCTGTGCT GGGGCCCTGC AATTGGAATG 2 700 
TTGGCCAGAT 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



2710 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
GCCATCCCAG CAGTGGAGGT GTTTG 
(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
TTGAACAGCT CTGCCAGGTT CACC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
TGGAGGTGTT TGAAGGGGAG CCAG 
(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CAGGTTCACC TTGTTCCCTG GCTC 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linea r 

(ii). MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGGTATGGGA CTAGCTGGCG 
(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CTGGCCAACA TTCCAATTGC AG 

22 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
ATGTTATGCA ACCCTTTGCG ACAC 
(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GTGTTTGAAG GGGAGCCAGG GAAC 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
AGAGACAGGG TTTCACCATC TTGG 

24 
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Bl a hum MAP I EV6EAI P AVEYTE GE P GN KVNLftELFTZGKK GVX, FGVP GAFT PGCSX 

Bl StaOU* MAP I KVGHA2 5 S V^FEGEPGKKVBlAELFKGKXGVIjre VPGAFTPGCSK 

B 1 6mc«U*« raLFGETOOAGAXKAXCJA QVVftCLS VWDVFVI EEWGPAHQ^^GKTv^CLLW? 

Bl3 hUlSf, PTGAFGKETDLLLDDS LV5 1 FGKFJ&KRFSMVVQlJG IVKAUlVM DGT'GL 

Bl Smous © PTGAI 7 GKATDLLLPDS LVS LF<SKFJUiKP;F.S*-SV:I DWG IVK&INVEP DGTGL 



BlBhura 
BlfijTiou.se 



TCSLAPtfl 
rC3LA*WIL30L 



CLUSTAL V slitftmwmt pf human wi^ lit cENA aequ*rj«i! (identity: SL2/780 
IB, 5*}: 



El 9 hum 
Blfirat 



gc<^ggaggcgsagtggaagtggccct^ 

' TG O^PC— — - — CTAGQC AG 



BlBhum 

MS hum 
B13rat 



CGTGT-SCGCCCTGAGACGCTCAGCeGCCTATATACTCCTCGGTGGSGCCG. 
CATA GCC GGA TCG GTCCT-CCGTGCA^CGBCTACTTGWJC — 



•* * ^ 



GC "5 GTCAOT CTGCGSCAG C G-GCAG CAAGACGGTG CA GTGAA 6 GAGAGTG & 
-GTSCCTSGCAGGCAGAGCAGGCC^S AAACGA G CASGTT G G 

* A * * * * * * A * * «4 «- ♦ 



3KS0OCID: <WO 890905-:a2TI > 



WO 



PCIYBE98yO0124 



6/7 

PIG ♦ 5b 



BlEhum G OSTCTG GCG GGGTC C G CACTi'TCRGCRGA G CC G CT G CA G C CAT GGCCCC 

Bl tat. GAGTGT G g T'-G G G GC O CG CAG CT T CAG CAGTG C OG OS GTSACT'A T'G GC CCC 

*"* *:** **** AAA AAA A******* ^Wi, A * li A A A A A 



BIShum 
BIS rat 



AATGAA G G T'G GGAG AT GCGATCCCAG GTG G A G GTGTT T GA AG G< GGAti C 
GATCAA G G T'G GGAG A CACCATT OI7 C T CAG TG G A G GTATTTGA AGteG GAAC 

"* *********** AAAA A* »fV;HAUU 4******4: AA* A 



Bid rat: 



'oGGAAC Aft G CTGAA C CT ft <3 CA£AG CT G TT CAA ftfi GCAAGAAGGGT <7TG 
C T GGAAAGAA G GTG ATsCTTG £ CAGAGCT G TT C AAG<^C&AGAAAG G»T OT T 

* W A-*. ******.* W WW4W.**t.ntAU^ **.**** U4U 



BIS hum 



BlShum 
BlSrat 



CTCTTrGGAGTTCCTGGMCCTTCACCCCTG^TGTTCCAA^CArACCT 
TTGT T TG GAGTCCCTGG C?3C p L TT TACACCTG G CTGTf CC^AGAGC CATC T 

****-***AAA **^^**t* +:* *A Wi44. * * * * -A A A A A A * *.* V. * 

GCCAG GGTTTGT G GA G CAEGCTGAGCCJ 'CTGA/i G GCJ CAA GG-SAGT C Cft^ G 
GCCT G GGTTTGTI5G A GGAAGCCGGAGCTCT GAAGGCCAA GGC^AG CA CAAG 



A A A A**A-******AfcAA A A A 



BlSbum 
Bia.rat 



TGGTG&CCT GT CT GA G T GTT AATGAT G CCTT T G T <^ A C'l' -5 GC-3AG T G GG-isC 
TGGT GG CCTG T CT G A PTCTTAATGAT GT CTT ^ GTG A CT <5 GAGA G T GGG-GT 

■A-A*AAAAA*A.**.**.****a,*A., fc ,fc4,AA* * * * Ir A A * A A * * ***A-**A* 



Bl ttbum CG^CCCAC^GGCGGAA.GGCAAGGTl'CGSCT CCTGGCTGA'J'CCCACT'GG 

B1 5 ra t CGAGCCCACCAGGC AGAAGGCAAGGTTC AQCTCCTCGCTGACCCCACTGG 

+ * *. A A 4*Jr*4 4*^tttv* AAAAAAAA**** ** A A A A A A 

Bl Chum GOCCTTT<KGAAGGiRGACW%CTTAl^ ACTAGATGATTCGCTGGTGTCCA 

Bl B rat AGCTTTTG^AAAGGAGACAGATITI ACTACTAX*ATCTTTCrrtGGrrGTCTC 

A* **AA* **^t*H* *A AAA A A A * * * * * * * A A A AAA-**** 

BlBhum TCTT T'GG GAATC GA CG TCT CAP, C5AGG3T T CTCCA TGGT-SfiT A CA^GATGGC 

Bl B r at TCTT TSG GAATC GTC G GC TAAAAAGGT TCTCCA T GG TCATA GACAAGG GC 

A AAA AAA A***** * * A A AA A * * * * f. * * * A A A A A A* * * W1 

BIBhum ATAGTGA 2 * G G CCCT GA^ GFGGAACCAGA TGG CACAGftCC'J 'CACC T G CAG 

Bl B rat GTAGTAAA G GCACTGA^CGT'GG AG CCGGATGG CACA^CCTGACC T GCAG 

^*** A A A A A ***** Wf4 tt HU AAA 

BlBhuET. C CT £*G CAC C Cj=lAT it T C AT C T CAO AG^CT CT i^AG G CC C 1 V G G 'LyC CAG/'. T TA 1 J T 

Bl 6 1: a t CCTGGCCCCCAAC ATCCT^rc^CTA-rTCTGAGGCCCl^ -CC/^GA — Al'G 

^*-^ 4,4 -* W*** *r** A A AAA A A ****** * A A A A A A ***** A 

B 1 fihUETi TCCT^CTkCCCCTCC CTATCTC^CCTCC:C4^GOCCTGrGCT^E-'G«CCCTG 

BIBrat TCCTCT^CTCTCCC-^TCT'CCTCCACCCAG^^ 

AAAAA A A**** fc * * 4 * ^ ****** AA ***** A 

B L 6 hum CA A'l-JGGAATG-' TTGGCCAGATTTCTGC- 

B 1 0 tti. fc T ACCTC CTTACCTGAG G G C'CACTGG.?«ATOGAAC C2 'l'GA CA«"^j^TT T CTfG C 

*****AAA *4A***A4AA*A*. 

B 1 6 hum A?^T AAACACTTGTGGTTTGCGG.Z' AAAAAA - — . 

016 rat AATAAACA GT T - TAf TT GTCl^A. E LAAV-J , J l uV , iAAA^ i AAfiJ , t 

AAAAAAAA A * * * * * * A AAA**** 



SUBSTITUTE SHEET (RUJJE 



WO 9SW9Q54 



PCT/BE9Sy0tHM 



7/7 



CZaU^tAL V alignment of humeri end mcuat cGHA ponces (Identity: 55V67S 



GC CAGGAGGCGG AGTGGAAGTGGCCGTGG GGCGGGTATGGGACTAGCTGG 



Bl Bhum 

El 



CGTGTGCGCCCTGAGRGGCTCAGC GGGCTATATACTCGTCG &TQGGGC CG 
„ jgc VCCGTO -CATCGACGTGcrTG 



GCGG^C^GTCTGCGGQAGCGGCA^CT^GACGGTGC^CTGAfeGGA^AGTGG 
GCAGGCA&- — AGCAG&CCGG AAAGAAGCAG GTTGG 

"***•♦** 4-4** **....» v 



BlShuro 



GCGTCTGGCGGGGTCCGCAGTTTCAGCAGAGCCGCTGCAGCCAT^GqcCC 
GAGTGTGGCG^GCCCeCAGCTTCAGCAGCTCCG^^ 



Bl fclmm 



^TC\AGGTGGG?.GATGCmTCCCAG^GTGGAGGTCJTT , r , GAAGGGGAGC 
GATCWGGTGGGAwATGC-CATri'CCCTCAGTGC^eGrATTtGAAGGGGftAC 



ElBtvom 

BiBlEWUS-* 



CAG&GAACAAGGTGAAC'CTOGCAGACCrGTTCTA-SOGCAAG^JSGGTGT'G 
CGGWA^^<5GTGAACTTGGCA^GGTGTTqAA«GGCA\&AAAGGTGTT 



E-lShum 
Bl Mouse 



CT GTTTGGAGTTCCT GGGGCCTTC ACCCCl'GGATGTTCCAAG.teACAC CT 
rrCTTTG<WCTCCCTGGGGCATTTAM^ 



Bl&mouE* 



gCCAGGGTTTGTGGAGCAGG CTG AGGCTCTGAAGGC CAAGGGAGTCCAGS 
GC CTGG GTTTG ? GGAG CAAG CT GGAGCTCT G Aft G GCTAAG GGA G CG C AG C 

*************** ******** W** 



El 3 hum 



TGGTGGCCTGTCTGACTGTWATCL^TGCCT7TGTGACT<^CGP t GT^GGGC 
T L ^GTGGCCTG1TCTGAGCGT^/'ATGACQTCT7TGT^TTG.^GAGTGGGGT 



BlBhiari 



CGAGCCCAC^AGGCGG-^JSG-ZAABGTTDGiSCT-CCTGGCT^TCCCACTGG 
CGAGCCcAca^GGCAGAAGGCAAGCTTCGGCTCCTGECT'SACCCC^.C^GG 



BIG ham 



G GC CT T T G G G AACrGAG^CAG AC TT ATT AC T A GAT GAT TCGCTGGTGTC CA 
AGCCTTTGGGftAGGCGACAGACTTATTATrGSATSATTCTWGGTGTCTC 



Big bum 

Bl Bttous* 



O-COTTGGGAATCGACCTCTCPAGAGGTTC^CCATGGTGGTAnftGGATGGC 

T ET T TGG GAATC GT CG G CT GAAAAGGTT CTC CATG G T GATAGAC AA CGG C 
***.*»**4*.#** «,* -***.**.*.****..**-♦** * 



Biahuni 



ATAGtGWJSGCCCTGAATGTGGA-ACCAGftTGGCACAGGCCTCACCT^CAG 
ATA G T G AAG G CACT G fAC £T £ GAG C CAG AT" G G C A CAG C CCT CPXLCT & CAG 



BlSnwu^e 



CCTGG CACCCA^sTATCA TCTCAOJG CTCT G AGG C C CTGG GCCA GAT TACT 
CCTGGCCCCCAAC ATC CT irTCCCA^CTCTSAGGCCCTGG-CMGATG- - - 



£ 1 Shorn T DETCCACCCCTCC CTATCT'CAC CTGCCCAGCrCTGTjGCTGGGSCCCTGC 
Bl 9 mouse TCwTCTGACTCTCCC-ATCTCTC CCACCCGGCTCT Av-GCC 

* 4 * « * ■* 44*44 * 4 * A- * « 4 4-4 #4 A A 

Bl Phian AATTGG^TGTT6GC»GATT^CTGC^ATJ»AACACTTGTGGTTTG^GGA^ 

B 1 BmOU *e AAAAGGCTCGST ACCTCCTTACTGGGA&C- GACGT 

<•****• * * > <• * « * * 



3NSDOCID: --WO SS09054A2T l_> 



WO 99/09Q54 



1 

SE&firEtiCB LISTING 



U> SEHERAL INFORMATION: ' 

(±|- APPLICANT : 

(A) NAMES UNI VERS I7E CATKGLLOUE I>E LOUVAIN 

Kalle^ Univarsitairee 
(B J STREET ! Place de 1' Uftiwrs it*, L 
( C ) G IT Y : UOUVAI N-IA- HSU VE 

(E) COUNTRY i BELGIUM 

(F) POSTAL CODE (£IP!j 

(A) >?A>!Ei UNIVERSITB DE MOWS-HU tfAUT 

(B) STREET: Plicc Pare 20 
'tO CITY: MOWS 

IE) COUNTRY: &E3.G3UH 

CD POSTAL CODE VZtVi i B-700D 

(ill TITLE D? INVENTION: PERDX I SOME-ASSOCIATED PEFttDE, WCLE50T I DE 
SEQUENCE ENCODING SAID PEPTIDE 4 AMD THEIR USES IH 
DIAGNOSTIC AND/OR THE TREATMENT Of LUNG INJURIES AMD 
DISEASES AND OF OXIDATIVE STRESS -RELATED DISORDERS 

Uii) HUWBER OF SEQUENCES: 19= 

jiv) COMPUTER READABLE FOFW: 

(A) MEDIUM TYPE: Floppy dis): 

fB) COMPUTER: IBM PC compatible 

fC) OPERATING SYSTEM l PC-DD3/M5-D05 

ID) 3 0 WWARE j Fa Cent In Ralea.se #1-0,. Version {2JPO| 



(21 INFORMATION FO£ £E£ 3D No: 1: 

(i) SEQUENCE CHARACTERISTICS; 

1A) LENGTH: BOS base pa 4* ft 
{&> TYPE; nucleic acid 

STFhWDEDWE&S s single 
\ D J TOPOLOGY : 1 in car 

U i ) MOLECU LE T Yr E : cDN A 

(iii) HYPOTHETICAL* NO 

(iv) AUTI-SENSE: NO 

tvi) ORIGINAL 3DURCE: 

<A} ORGANISM ^ Komo sapiens 

tix} FEATURE: 

{A \ MfiJ«1E/ KEY : CDS 

<&} LOCATION: .691 



(xi) SEQUENCE DESCRIPTION; EEQ ID NO; It 
GCCAGGAGGC G-GAGTGGAAG TGGCCGTGGC- GCGGGTATGG GACTAGCTGG CGTGT&CGCC 60 
CTGAGACGCT CAGCGGGCTA TATACTCGTC GGTGGSGCCG GCGOTCAGTC TGCGGCACCG 120 



2 

GCAGCAAGAC GGTGCAGTGA AGGAGAGTGG GC'GTCTv?GCG GGGTCCGCAG TTTCAGCAGA 190 

SCCGCTGCAG CC ATG GCC CCA ATC AAG GTG GGA GAT GCC ATC CCA GCA 22 a 

Met Ala ere He Lye val Gly A^sp Ala lie Fro Ala 
1 5 IS 

GTG GAG GTG TTT GAA GGC GAG CCA G<?S AAC A=iG GTG A*C CTG GCA GAG 276 
Val Glu Val Phe Glu Gly Glu. Bro Gly Asn Lys Val Leu Ala Glu 

15 20 25 

CTG TTC AAG GGC AAG AAG GOT GTG CTG TTT GGA GTT CCT GGG SCC TTC ^ij 
Leu Fhe Lys Gly Lya Lye Gly v*l Leu Phe Gly Val Pro Cly phe 
30 3& 40 

ACC CCT GGK TGT TCC A*iG ACA CAC CTG CCA GGG TTT GTG GAG CA5 G r T 37 ^ 

Thr Pro Gly Cys Ser Lys Thx Kia Lew fro Gly £>he val Glu Gin Ala 
45 5D 5& 5D 

GAG OCT CTG A*,G GCC AAG GGA GTC C*G GTG GTG GCC TGT CTG AGT GTT 
Glu Ala Lea Ly* Ala Lya Gly Val Gin v<Q Val Ala Cys Leu S»r Val 
65 70 J 75 



AAT GAT GCC TTT GT*G ACT GG-C GAG TGG GGC CGA GCC CAC A^G GCG 
Afln -M* pb « val Thr Gly Glu -ftp Gly Arg Ala hia t,y 5 Ala Glu 

£0 £5 go 



CTG AAT GTG G7^A CCA GAT GGC ACA GGC CTC ACC TGC AGC CTG GCA CCC 
Leu A 3n Val Glu Pro Aap Gly Thr Gly Leu Thr Cys Sci Leu Ala Pro 
145 ISO 155 



469 



GGC AAG GTT CGG CTC CTG GCT GAT CCC ACT CGG GCC TTT GGG AAG GAG 51 G 

Gly Val Arg Leu Leu Ala Asp Pro Thr Gly Ala Fb* Gly Lys Glu 

100 105 



ACA GAC TTA TTA CTA GAT GAT T'CG CTG GT'G TCC ATC TTT GGG AAT CGA 564 
Thr Asp Leu I,eu l* u Aap Asp Ssr Leu V*l $er He P hc Gly Asr- Azu 
liO US 120 



CCT CTC AAG AGS TTC TCC ATG GT£ GTA CAG GAT GGC ATA GTG AAG GCC 612 
Arg Leu Lys Arg Pbo £er Met V«l Val Gin Asp Gly riit Val L-ye Al* 
125 130 135 i^o 



AAT ATC ATC TCA CAG CTC TGA GGCCCTGGG C CAGATTACTT OCTCCACCCC 7ll 
Asn lie lie £<-r Gin Leu * 
160 

TCCCTATCTC ACCTGCCCMG CCCTGTGCT'G GGGCCCTGCA ATTGSAATGT TGGCCAGATT 771 
TCTGCAATAA ACACTTGTGG TTTGCGGA^lA AAAft g05 

12) INFORMATION FDPs SE£ ID HO: 

fi) SEQUENCE CHARACTERISTICS: 
[Al LENGTH: 163 airiino aciijj? 
{E) TYPE 5 amino Acid 
fD) TOBOLOGV: lityrar 

(ii) MOLECULE TYPE: protein 

(>ci) SEQUENCE DESCRIPTION: 3EQ ZX> ttO: 2: 



SNSDOC!D: <V/O_9SCS05<A2T1 > 



WO 9P/0M154 PCT.TBEW00124 

Met Al* Pun. He I.ys Val Gly Asp Al* He E£p Ala Va.1 Glu Val Phc 
1 5 10 15 

Glu Gly Glu Pro Gly Asm Lys Val Asn Leu .Ma GIq Lew P-h* Lys -Sly 
2Q 2£ ao 

Lys Ly-; Gly Val Leu Phe Gly Val fro Gly Ala Fhe Thr Pro Gly Cys 
35 40 

Ser Lys Thr Kie Lau pr* Gly Phe Val Glu Gin Ala Glu Ala Leu Lvs 
SO ErS £G 

Ala L-ya Gly Val Gin Val Val Ala Cys Leu Ser Val Aen Asp Ala Phe. 
65 70 75 * 60 

VaL Thr Gly Glu Trp Gly Arg Ma. His Lya Ala Glu Gly Lys Val Arg 

65 50 ' ~ 95 

Leg Lsu Ala Aap tifi Thr Gly Ala Phe Gly Lys Glu Ttir Asp Leu Leu 
100 105 110 

JLeu Aisp Asp S&r Lftu Val Ser lie Ph* Gly Aan A*q Arg Leu Lya Arq 
115 120 125 

Fhe Ser Mat Val Val Gin Asp Gly 31^ Val Lya Alt Leu Asrj Val Glu 
130 155 140 

Pro Asp Gly Th.r Gly Leu Thr CyB Ser Levi Ala Pro Aen Tie lie Bar 
i« ISO 155 160 

Sir* Leu 



{2} INFORMATION FOR 5EQ ID HO: 3: 

[i] SEQUENCE CHARACTERISTICS; 

(A) XENOTH : 7 BO baee p*ir* 
[B> TYFB; miclcic acid 
\C) single 
ID) TOPOLOGY i linear 

(ii) MOLECULE ?YP£: E-NA I gnomic) 

tiia? EYEOTHfcTlCAL: NO 

Civ) AHTJ-SOfSE; NO 

(vi) original SOURCE; 

(Af OfcGANISMt FLitt^s fcattus 

(ix) FEATURE - 

{A I -M AWE/ KEY: CI>5 

<s| U>CATIOH:I36*.624 



Cxi) SEQUENCE DE&CKmiON: SE£ ID NO: 3: 
TSCCTCCTAG GCAGCATA5C CG6ATCGGTG CTCCGTGCAT CGGCTACTTC GACGTGCGTG 6o 
GCAGGCAGAG CAGGCCGGAA AGGAGCAGGT TGGGAGTG7G GT0C&GCCCG CAGCTTCAGC 120 



WO 99/09054 PCTVBE.'PS.'tltriH 

4 



AGTGCGGCOG. 


TGACTAT-GGC 


CCCGATCfcAG 


GTGGGAGACA 


CCATTCCCT'C 


AGTGGAGSTA 


1 s 0 


TTTGPAGGGG 


AACCTGGAA\ 


SAA£GTGAAC 


TTGGCAGAGC 


tgttcaagga 


CAAGAAAOGT 


240 


GTTTTGTTTG 


GAGTCCCTGG 


GGCATTTACA cctggctgtt 


CCAAGACCCA 


TCTGCCTSGG 


300 


TTTGTGSAGC 


AftGCCGGAGC 


TCY£AAGGCC 


aagggagcac 


AAGTggtggc 


CTGTCTGAGT 


360 


GTTAATCATG 


YCTTCGTGAC 


CTG ca gagt g g 




ACCAGGCAGA 


AGGCAAGGTT 


420 


cAgcTcctgg 


CTGACCCC?,C 


TGGAGCTTTT 


GGAftAGGAGA 




ACTASATGAT 


ISO 


TCTTTGGTGT 


C-TCTCTTTGG 


GAATCGTCGG 


CTAAAAAGGT 


TCTCCATGG? 


GA7ASACAAC 


540 


GGCGTAOTAA AGGCACTCAA 


CGTGGAGDCG 


GATGGCACAG 


GCCTCACCTG 


CACCCTGGCC 


600 


CC^AACATCC 


TCTCACAftCT 


CTGAGGCCCT 


GACCWJAATG 


TCCTCTGACT* 


CTCCCATCTC 


btio 


CTCCACCCAG 


CTCTGGGCCA 


AAGGCCCAGT 


ACCTCCTTAC 


CTGAGGfeCCA 


CTGGAATGGA 


720 


AGCTTGACAA 




AOAAACAGTT 


TAA*rTTGTC$i AAAAAftAAAA 


AAAAftAAAAA 





t£) INFORMATION FOR SfiQ ID NOt <h 

(ij SE«VJENCE CHARACTERISTICS: . 

(A) LENGTH ; 162 amino acidc 

(B) IICPEJ amine, q^id 

(C) STPANDE&NESS: single 

(D) TOPOLOGY: linear 

(ii|- KDLECULE TYPE: peptide 

(ill? H YPOT'ttETI CAL : NO 

(ivfr ANTI- SENSE t fOQ 

(vi| ORIGlKAli SOURCE; 

(A) Oft&fctilSM: Rattu* Rattua 

FEATURE; 
[A) NAKB/KEY: Modified- a it e 
(fc) LOCATION: }*7 

IP) OTHER rxroRMATIW:/prciduct= "Giu or Gly" 

lix) FEATURE: 

(A) KAKE/ KEY ■: Mc-dlfi^d-aite 

(B) LOCATION: 63 

(0) OTHER INFOKMATI ON i /product= "Leu or Prcs 1 ' 

lix) FEATURE: 

fA) NAME/ KEY ; Modified- a it* 
IB) LOCATION; ?9 

10) OTHER. lNFOP>^TaoWi/product= M Ala or Val" 



Uil SEQUENCf. PESCRIFTlQNi S£Q ID NO: 4r 

Met Ala Pro lie Lya Val Gly Asp rhr lie Sor Val Giu Val J>be 

lb ID 15 



a-JSDDCD- <WO 9509054A2TI_> 



WO 99/QM54 PCT/UE9a/a0124 

5 

X=.a Gly Glu Pro Gly Lya Lys Val Ae-n Leu Ala Gin Leu Phe Ly s Asp 
20 25 ^0 

Lye Lys Sly Val Leu Phe Sly Val Pro Gly Ala Pht Thr Pro Bly Cvb 
35 40 % 

3 ex Lye Thr Hi £ Leu Pi* Gly the VbI 51 u Cln Gly -Ma ly 3 

50 5* gO 

Ala Ly* Gly Ala Gin Val Val Ala Cys Leu Ser Val Aim tep Xaa Phe- 
65 70 7 5 80 

V*l Thr Ala Glu Tip Gly Kxq Ala His Gift Ala Glu Gly Lys Val Glrj 
£5 90 95 

Leu Leu Ala Asp Pro Thr Gly Ala Fti* Gly lys Glu Thr Asp Lau l^eu 
10D 105 

L.&u Asp Ai P fler leu Veil Ser Leu Phe Gly A*n Axq Arc* Lou Lye Arq 
115 12 Q " 125 

Phe Sfc* Mfct Val lie Aep Lys Gly Val Val Lys Ala Leu A*n Val Glu 
130 135 J*o 

Pro Aep Gly Thr Gly Leu Thx cy* Gcr Leu *U Pro Aan rio L-eu 5r-r 

150 155 166 

Gift Leu 

f2 J 3N FORMATION FOP, SE£ ID NO: 5; 

(±| SEQUENCE CHARACTERISTICS 2 

fA) LENGTH : 675 base pair* 
IB J TYPE: nucleic acid 
rc> STRANDEDHESS; circle 
f D} TOPOLOGY: linear 

(±i> HOU5CTLE TYPE: HQA {genoird.^ 

HYPOTHETICAL : NO 

i'lvj ANTI-3EH5E: NO 

(vi) ORIGTHAL SOURCE r 

•!A> ORGANISM; Mouse 

(ixj FEATURE: 

(A | NAME/ KEY: CDS 
(E) LOCATION ; S3. .5&B 

LxiJ sequence DESCRIPTION: 5EQ ID NO: 5 2 

TGCTCCGTGC ATCGAC5TGC TTGGCAGGCA GAGCAGGCCG GAAAGAAGCA GGTTGGGAGT 60 

GTGGCGGAGC CCGCAGCTTC AGCAGCTCCG CGGTGACCAT GGCCCCGATC AAGGTGGGftG 12 0 

AT-5CCATTCC CTCACTGGAjG GTATTTGAfcG GGGAACCGGG AAAGAAGGTG AACTTOSGAG l$Q 

AGCTGTT'CA 2 ^ GGGCAAGAAA GGTGTTTTG? TTGGAGTCCC TGGGGCATTT ACACCTGGCT 24 0- 



WO 99/09054 PCTyBID98A>0124 



^TCTAAGAC CCACCTGCCT GGGTTTGtGG AGCAAQCTGG 


AGCTCTGAAG 


GCTAAQGGAG 


300 


CGCAGGTG<?T GGCCTCTCT-G AGCGTTAATG ACGTCTTTGT 


gatt&Aagag tggggtcgag 




CCCACCAGGC AGAAGGCAAG GTTCGGCTCC TGGCTGACCC 


cactggagcc 


TTTGGGAAGG 


420 


CGAGAGACTT ATTATTGGAT GATTCTT7GG TSTCTCTCTT 


tgsgaatcgt 


CGGCTGAAAA 


4 DO 


GGTTCTCCAT GGTSATAGAC AACSGCATAG T^AAGGCACT' 


k\ftAGG TioGA G 


C-CAGA'? G GCA 




CAGGCCTCAC CTGCAGCCl'G GCCCCCAACA TCCTCTCCCA 


ACTCTGAGGC 


CCTGGCCAfcA 


eoo 


i^iL-L,Jti L1L1 UWYJX T U T CC- CACC C GGCTCTAGGC 


CAAAAGGCTC 


GGTACCTCCT 


660 


TACTGGGAGC Cj\CGT 














(2) I FORMATION FDR 5EQ- ID NO: 6: 








(i) SEQUENCE CHARACTERISTICS: 








<A) LENGTH: 1S2 Amino acide 









{ B ) TYP E ; aifd.no * 
<C> 5TRAHDEDNE.59S eit>oU 
{D> TOPOfcOGY s linear 

fill MOLECULE TYPE: p^pticta 

(iiil HYPCTHCTICAL : KO 

{ivj -WT I -SENSE: HO 

(vi) OR2 GENAL SOURCE: 

fA) ORGANISM; Mouee 



(xi) SEQUENCE DESCRIPTION; 5BQ ID NO: 6: 

M«t Ala Pre He Lys Val Gly A 5p Ala ILe Pro $ cr Val Glu val Ph« 

1 5 10 is 

Glu Sly Glu fro Gly Ly* Lys Val Aen Lsu Ala Glu Ley Phc Lya Gly 

2^ 25 30 

Lys lys Gly V*l L*u Phe Gly Vol Pro Gly Al* Phe Thr Pre Gly Cys 

-•5 40 ^c, 

3er Ly* Thr Big Leu Pre Gly Phe V al Glu Gin Ala Gly Al„ I>eu Lye 



50 55 



&0 



Al* U/z Gly Ala Gin v*l Val Ala Cy* Leu 3er Val Asn Asp VaJ, ?he 
6 " 1,0 * 0 

v=d 11b Glu Glu Trp Gly Arg Al* Hi* 61n Ala GLu Gly Ly* V*l Arg 
85 j, 0 ss 

L-eu Leu AU Asp ?ro *hr Gly Ala Phe c?ly Lys Ala Th* Asp Leo J,* u 

100 ids 11Q 

Leu A*p Ser leu va* Scr Leu Phe Gly Ara Arg Aire lau La ?5 ^ r<y 

11-5 L20 125 



7 



PCT/EE9S/Q0124 



E'ho Sei- Met V*L lie Asp Asm Gly lie Val iys Ma L*ut Asn Vai 31 u 

Fn? Asp Gly Thr Gly Lea Thx Cys Leu; Ala Pi:o Asci lie I^u Ser 

145 * 150 i55 160 

Gin Lou 



<2> INFORMATION FOR SfcQ ID HO 5 7; 

t±) SEQUENCE CHAFACTEFXSTlCS: 

{A) LENGTH: dfiS base pairs 
CB) TY^Es nucleic acid 
(C) STRANDEDU&SSj single- 
(]» TOfOLOGYs linear 

tit) >5DI^ECL)JjC TYPE; cDMA 

liii} KYPOTKETI CAL : HO 

Civ} ANT I -SENSE: NO 

(vi) OPJGINA1 SOU^CEi 

(A) 0R£Artt3Mi HC3HO eapLetis 

(±x> FEATURE; 

(A I NAME/KEY; CDS 

(E-] LOCAT EON i L61 . . 392 



[x±) SEQUENCE INSCRIPTION: SEQ ID NO* 7; 



GGGTATSGGA 


CTAGCTGGCG 


TGTGCGCCCT 


GAGACGCTcA 


GCEGGCTATA 


TACTCGTCGG 


€0 


TGGGGCCGGC 


GGTCASTCTG 


CG6CAGCGGC 


AGCAAGACGG 


TGCAGTGAAG 


GAGACTGGGC 


12D 


GTCTGGCG'SG 


GTCCGCAGTT 


TCAGCAGAGC 


CGCTGCAGCC 


ATGGCCCCAA 


TCAAGSTTCG 


l&D 


C-CTCCTGGCT 


GATCCCACTG 


GGGCCTTTGG 


GfAGGAGACA. 


GACTT AT T AC 


TAGATCATTC: 


240 


OCTGGTGTCC 


ATCTTTGGGA 


ATCGACGTCT 


CAAGAGGTTC 


TCOATGGTGG 


TACA5GATGG 


30D 


CATAGT-GAAG 


GCCCTGAATG 


TG^AACCAGA 


TGGCAGAGGC 


CTCACCTSCA 


GCCTGGCACC 


3*0 


CAATATCATC 


TCACAGCTCT 


GAGGCCCTGG 


GCCAGATTAC 


TTCCTCCACC 


CCTCCCTATC 


420 


TCACCTGCCC 


AGCCOTGTGC 


TGGGGCCCTG 


caattggaat 


GTTGGCCAG 




<5£9 



(2) INFORMATION TOR 3EQ ID NO: E : 



ii) SEQUENCE CHARACTERISTICS: 

(A| LENGTH: 501 base pai*e 
(B| TYPE: nucleic acid 
<C| STRAW DfcHNESS: eift^le 
(PI TOFOLOGY: linear 

tii) MOLECULE TYPE: cDHA 

[ilij HYPOTHETICAL: WO 



WO 95WU90S4 PCT/BE98/Q0144 

(iv> ANT I ~ SENSE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGAN ISM i Kocrio eapiena 

(ix> FEATURE: 

(A) MAME/KBY j CDS 

(B) LOCATION: 161. .514 

(k±| SEQUENCE DESCPJCFTIOW: SEn NO: S: 

GGGTATCGGA CTAGCTGGCG TQTGCGCCCT GAGAGGCTCA GCGGGCTATA TACTCGTCGG 60 

TGGGSCCSGC GGTCACTCTG CGGCASCGGC AGCAAGACGG TGCAGTGAAG GA^GTG-SGC L20 

GTCTSGCGQG GTCCGCAGTT TCAGCAGAGC CGCTGCA3CC ATGGCCCCAA T'CAAGACACA L60 

CCTGCCAGGG TTT'GTG-SAGC AGGCTGAGGC TCrGAAGGCC A*tGGGAGTCC AGGT5GTGGC 2*0 

CTCTCTGACT GTTAATGATG CCTTTSTGAC TGGCGAGTGG GSCCGAGCCC ACAAGGC66A 300 

ACG CAAG43TT CGGCTCCTGS CTGATCCCAC TGGSGCCTTT GGGMGGN3* CAGACTTATT 350 

ACTAGATGAT TCGCTGGtGT CCATCTTTGG GAATCGACGT CTCAAGAGCT TCTCCAtfGGT 420 

G<7TACAG<3AT GGCATAGTGA AGGCCCTGAA TGTGGAAGCA GATOGCACAG GCCTCACCTC A B0 

CAGCCTGGCA CCCAATATCA TCTCACAGCT CTGAGGCCCT GGGCCAEATT ACTTCCTCCA S4 0 

CCCCTCCCTA TCTCACCtTGC CCAGCCCTGT GCT^GGGCCC TGCAOTTOGA AtGTTGGCC* 600 
G 



(2| INfORMATIQH FOR ID HO: 

fi) SEQUENCE CHARACTERISTICS i 

•lAl- LEKGT-K: $04 base pairs 

TYPE: nucleic: acid 
\C\ ST'PAHDEfcNCSS : eiftolc 
m TOPOLOGY; linear 

(ii) MOLECULE: TYPE: PHA I genomic ) 

(iii) KyPOTHETICAL: No 

(iv) AHTJ -SENSE: NO 

(Vi) ORIGINAL SOURCE:: 

(A | ORGANISM: Komo sapiens 

(±X) FEATURE: 

<AI NAME/KEY i CDS 

(Bl LOCATION! 151 r ,517 

SEQUENCE DESCRIPTION : $EQ II> HO: 9: 
GQCTMCCCA CTAGCTGGCG TOTGCGCCCt GA6AC0CTCA GCGGGCl'ATA TACTCGTCGG 



601 



60 



RN'SDOCJD- sVVO 9909C54A2TI > 



WO 99/05*54 PCT/B1JW/D0TM 

TGGGGCCGGC GGTCAGTCTG CCrSCA'SCGGC AGCAAGACGG TGCAGTGAAG GAGAGT3GG0 1£0 

GTCTGGCGGG i^TCCipCAGTT TCAGCAGAGC CGCTGCAGCC ATGGCCCCAA TGfiAGGTGGG 160 

AGATGCCWC CCAGCASTSG AGGTGTTTGA AGGG5AGCCA GGGAACAAGG TGAACCTGGC 24 0 

AGASCTGTTC AAGGGCAAGA AfrSGTGTGCT GTTTGEAGTT CCTGGGGCCT TCACCCCTGG S£l0 

ATGTTCCAAG GTTCGGCTCC TGGCTGATCC CACT5GGGCC TTTG5GAAGG A6ACAGACTT 360 

ATTACTAGAT GAT^CGCTGG TGTCCATCTT TGGGAATCGA CGTCTCAA-SA GGTTCTCCAT 42 0 

GGTSGTACAG GATG£CATAG TGAAGCJCCCT <3AAT GTGGAA CCAGATGGCA CAGGCCTCAC 4.90 
CTGCAGCCTG &&AGCGAATA TCATCTCACA -ScTCTGAGGC CCTGG-GCCAG AT TACT T C CT 

CCACCCCTCC CTATCTCACC ^GCCCAGCCC TGTGCTGGGG CCCTGCAATT! <SGAATGTTGG 60 0 

CCAG 604 

ra:- information for seq id wot io: 

(i,;. SEQUENCE CKAIiACTEftI ST I CS : 

(A) LEHGTN; 2710 baBE pairs 

(B) TYPE I ftuc-leic acid 

(C) STRAIN DEDN ESS ; single 
CD) TOPDUDGY: ilncax 

fiij WDLECULE TYPE; DMA igericortdst 

(iiil HYPOTHETICAL 5 HO 

AHTI-SENSfcs NO 

(vij OfclGXWAL SOURCE ; 

[A) OftGANISN: Hotac sapiens 

tlx) FEATURE: 

{A? NAME/ KEY s fi>:on 

1B> IiOGATIOW=2516, -2TL0 

(iX) FEATURE; 

(A* NAME/ KEY: exon 

{B} LOCATION: 2074 . .2135 

(±X) .FEATURE: 

<A| WAME/KEV: exon 
{E.| LQCAT'It>H:1932. 

(ix) FUTURE j 

(A I MAME/KEYi ftXcn 

(B| LOCATION : 175a .. 1059 

(ix) FEATURE; 

(A] M7UMB/K&Y; ©xon 
(£) LOCATION: 8 ft£ 4 , 



{xi> 



SEQUENCE DESCRIPTION: 3EQ ID HO: 10: 



WO 



PCI7BE98/D0124 



10 



TCTGTCCCl'T AGCGCCCCCG CGGGGGC*TA CCCCATCCCA CTCCATGACC TCCCCTCCCC &0 

CCATGSCGAA TTCCCACCTT TCTGTCTTTC ACTCACTTCC T&GAACCGTC CCCAGGGCCT 120 

TGGACCTTCC CCCTTCTCCT CCCAAACCTT GTGAGACCCC ATTCCCTTTC TACTTCATCC l&o 

TCCTCTCftAC TTTtfG-SGCrC CTCfcCAGGCC C^CACCCCTG ACTCTCTCTC CCTACCCACT 24 0 

CTGGTCCCAT GAAGCCCTCA AGTACTCTGG GGATGGATCC TTCCCCCTTC AAAAGA/TTCC 300 

ttcttttgtt ctacacctcc tggctgta&s ggcctggaca ccctccocca acgttccacd 

TGCCGCTGCC CTTCCTCTTC CTCCTCCTGA GGGtTGGGACC CTCAGACCFG &CCAAGATCC 420 

fcrcCCTCCA T-GTTGTCA.GG GACTCCXCCT CACCCCCAAA TACAGCCCTC TAGCCCCTGT 4 SO 

CCATTTTA.TT CCACTCCTTT CCTGTAACCT AGACAGCATG *TATGCAACC CTTTGCGA.CA 540 

CATGGGGAAA CCTTCCCTCC CWCCTCTGT TGTCACCAAT GGCCCCTTAA GAGGAGCAGG 600 

GCCACCTTGA AACTTGGAGG ATATGGGGTA ACCCAGTG6G AGCGGGCAGG GAGGGCCCtfT 56 0 

AG5GCTGGAG TATCCTGCTG GGTTTCAGCC CCGGTTCCTG CAGGCACAGC 720 

TGCCAGGCIC TcfGTTCACC TTCCTGCCTC TGGTTTGCCC CGGCTCITCTC ACCCCCCTT* 7 g 0 

COCTGGAGTC CTTCCTTCTA GGTGGGAGAT GCCATCCCAG CAGTGGAGGT ETTTGAAGGG 64V 

GA&CCAG&SA ACAAGGTGAA CCT&SCAGAG CTCTTCAAGG GCAAGAASGG TGTGCTGTTT S00 

GGAOTTCCTG GGGCCTTCAC CCCTG6ATGT TCGAAGGTGA GGCCCTTCCC CTTCTGAAGA &6Q 

TCAGGACCTG GGGATCTTTT GTGTTGCTCT TAAGTCCTCC ACATAGTCCT GATAGGACTC 1020 

CTAAAAAGGA TTTCAGTGCC ATCACAAAAC AAGTAGAGCT GGGTAGAGCT GGGCGCGGTG 1060 

GCTCACGCCT GTAATCCCAG CACTTTGGGA GGCCAAGGCG GGTGGATCAC GACGTCAGGA mo 

GTCCAAAACC AGCQTGGCCA AGAIGGTGAA ACCCTGTCTC TACTAAAAAT GCAAAAAAAT 12 00 

CAGCCGGATA TGGTGGCGGG CGCCTCTAAT CCCAGGTAT? GGGGAGGCTG AGGCAGAGAA L3«0 

TTGCTTGAAC CCAGGAGGCG TAGGTTGCAG TGAGrGGAGA TCGTGCCTCT GCAGT'CCAGC L32D 

CTGGGTGAAA GAGCGAGACT CCGTCTCAAA ATGAAAAAAA AAAAAGARAA CAAGTAGAGA 138 0 

CTGCAAAAAG GGAACAGTAC CGGSAATCTT GGAGAAAAaC AlACTACAAT TAAATCCAAC 1440 

ACCCCTGTTG GTCCTGCTAA ATGACAGGCA CTGTGGAAGG TGCTTGGGAC T DAGAT AAAT IS 00 

AAGACAAAGA TCTGCCCATG 6AAAGTTCAC GTCTGGACCA TAAGGCATT* GG7TTCA7TC 15&0 

TGAGCTTCCT AGTGGCCAAG GCAAAAAGGA AATAGAATGG TTTAGACAGC TCTCATTGlC 1620 

TGATCAAAGG TGTTGAGGCA GAGCACTGAG GftGGGCCTGG AGATAAAGGG TGGGCTGGGG 16(50 

GTCAGAT6CA GTTATCCCTT TGCCGACCCT TTGTTCCCCT TCCTCAGACA CACCTGCCAG 174 0 

GGTTTGTGGA GCAGGCTGAG GCTCTGAAGG CCAAGGGAGT CCAGSTGGTG GCCTGTCTGA LBOO 

GTGTTAACTGA TGCCTTTGTG ACTGGCGAGT GGGGCCGAGC CCACAAGGCG GAAGSCAAGG I860 



SOSDOCID: <WO_990905-iA2Tl_> 



II 

TGAGGTGAGG GGCCTGCASG GAGTCAGGAC CAGGTAGGftT ATTCTTCTTG TGACCTCTAC 19±Q 

TTTCTCTGCA GGTTCGGCT'C CTGGCTGATC CCACTGGGGC CTTTGSGftAG GTGAGTGTTC 1960 

CCCTGACCGC CACAGGGACA TGGCGGTKG G^&^GCAGTG GGGGCCCTTG GCCTCTTQAA 204 0 

GGATTTCTSA CACTTTTCTC TGTCTCTTCT TAGGAGACAG AC TT ATT ACT AC AT GA.TT C 9 2IO0 

CTGGTSTCCA TCTTTGGGAA TCGAGGTCTC AAGAGGTAAA AGTGGAGAGT CCTCTGTGGA 2160 

GAAAGTCCTC TOTGGGA6AG AGTCCTCTGT G-SGfljSAGAGT CCTCTG'TGGA GAG-SGTCCTC 222.0 

T GT GG-SftA GA GTCGTCTGTG GGGGAGATGT GTGGGAGAGA GTCCX-5TGTG GGGAGAGTCT 22 3 p 

TCTGTAGGGG AGAGTCCTCT GGGGASAGAG TCCTGTGTGS GGGAGAGTCC TCTGTGGGGA 234 0 

6A3TCCTCTG TSTGGA6AGA GFCCTGTGTG GFGGTGAGTC CTCTGTpGGG GAGAS TC ETC 24 00 

TGTGGGGGGA GTCCTCTCT'G aVoTTCTCTT GG-GCCCCTGG CTGTTCACTG CCT-STCTCCA 2-466 

J 

TGCC-CAGCCT CCAAGCCCAG GCTSATGCAS CTGGCTGGGC CCCTCTTTCC GGCAGGTTCT 2520 

CCATG-STGGT AGAGGATGGC ATAGTGAAGG CCCTGAATGT GGAACCASAT GGCACAGGCC 25 SO 

TCACCTGCAG CCTGGCACCC MTATOTCT CACAGCTCTG AGGCCCTGGG CCAGATTACT 2GdG 

TCCTCCACCC CTCCCTATCT CACCTGCCCA GCCCTGTGCT GGGGCCCTGC AATTGGAkTG 2700 

TTGGCCAGAT 2710 
(21 INFORMAT T OH rofc 9EQ 3D HO: ll : 

(il SEQUENCE CHAFACTER-ISTtCS: 

(A) LENGTH- 25 b*5« paixs 

(B) TV^2: nuclfric £L<ild 

{ C ) ST RAHI?£lit'3E,5 S : a i,r« gl c 
(Dl TOCOLOGY i Untax 

raLECULE TYPE: DMA (gnomic) 
(xij 5 EQUEMCE DESCRIPTION: &EB ID NO: LI: 



GCCAT CCCAG CAGTGGAGGT GTTTG 
f,2.l INFORMATXOW FDR $EQ ID NO: 12: 

111 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pajix* 

(B) TVPS: nucleic acid 

(C) STEWJPECNESS: single 
(Li) TOCOLOGY: lin£&* 



2S 



<iij MOLECULES TYPE: DMA (genomic) 



WO 99/Q5W54 

12 

(xi) SEQUENCE DESCRIPTION; S£© Id NO : 22; 
TT£«ACAGCT CTGCCAGGTT CACC 
12) INFORMATION FOR 5EQ ID NO: 13: 
(£) CHARACTERISTICS; 

(A) LENGTH: 2$ base paixe 

(B) TYPE; nucleic acid 
(c) strwtpsdness i a-lngX* 
[D) TOPOLOGY : lineax 

(iit MOLECULE TYPE: PNA (c?enojiiicJ 



Uil SEQUENCE DESCRIPTION: 5E>3 ri> NO: 13: • 

TG<5*G-3T«3TT TGAAGGSGAG CCAG 

(2 ] INFORMATION TOR ZEQ ID HO: U ; 

li) SEQUENCE CK^JIACTERISTICS: 
(A| LENGTH: 24 pair a 

(B| TYPE; nucleic acirj 
(C] STRftNDEDCTESS : single 
CD) TOPOI/DGY: linear 

(ii> WDLECULE TYPE; DD2A (genomic) 



[>:i> SEQUENCE DESCRIPTION! SEQ ID WO; 14 . 

CAGGTTCACC TTGTTCCCTG GCTC 

(2| TNFORMATI ON F^R S&Q Ip NO: 15: 

(i) SEQUENCE CBABACT ERISTICS; 

<AJ LENGTH: - 20 ba,£* pairs 

TYPE: nucleic acid 
(C| STRANPEDNESS : singl* 
<I>P TOPOLOGY: linear 

iii) HOLJSCULS TYPE v SNA {genomic) 



t«i) SEQUENCE PE&CRIPTIQN: 5EQ ID NO: IS: 
&GGTATGGG* CTAGCTGGCG 
<2) 113 BXDEI-SATTON FOR SEQ ID ND : 15; 

U\ SEQUENCE CHARACTERISTICS: 

[A) LENGTH i 22 ba-Jc pairs 

[B) TYPE: nucleic &cid 

[C) 3TRANDRDNESS; eir^l© 



aNSDCClD: <WO_990&05^A2TL> 



VCJ7BEMA)©I24 



£4 



Vp'O PCT/BR9S/Q012^ 

13 

liil MOLECULE TYPE? DNA (genomic) 



(*i) SEOUEHCE DESCRIPTIONS S&S ID NO! 16- 

CTG-SCCAACA TTCCAATTGC AG 22 

(2) INFORMATION FOR SEQ- ID ttOt 11 : 

iLi SEQUENCE CHARACTERISTICS: 
(A) LEHGTHz 21 base pairs 
IB) TYPE : nucleic: a*i* 
I'C) S-TFAMDEDNEas: aluyla 
fD) TOPOLOGY: linear 

{ii) MOLECULE TY££: W t ( genomic | 

(xil SEQUENCE DESCRIPTION: SEQ ID HO; 17: 

OTGTTATGCA ACCCTTTGCG ACAC 2-3 

{£J INFORMATION FOR SEQ ID WO: 1*1 

(i) SEQUENCE CHARACTERS ST ICS"-; 

FA) liEHCTH: 24 baae pairs 
IB) TYPES miclfiic acid 
\C) STRAHDRDKESS • single 
■ID} TOPOLOGY: linear 



(ii) MOLECULE TYPE; DUA fgettdcnic) 



(k1* seQOEKCE DE3CMPTi;ot2s SEC ID WO s 16; 

STGTTTQAAG GGGAGCCTJGB GAAG 

<2f INFORMATION FOR SfcQ ID NO: IS: 

[i) SEQUENCE CHAKACT ER.^ SXIC9 : 
IA) LENGTH: 24 basfc pAtrs 
<B> TYfE: nucleic acid 
(C| STFANDEttffES S : sitigl© 
(PI TQPC-LC'GYr 11m« 

(ii) MOLECULE TYPE: UNA {genomic) 



fxil SEQUENCE DE5CRJPTIOK!: SEQ ID NO: IS: 
ASAGACAGGG TTTCACCATC TTGG 



24 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




PCT 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 6 
C12N 15/12, C07K 14/47 



A3 



(11) International Publication Number: WO 99/09054 

(43) International Publication Date: 25 February 1999 (25.02.99) 



(21) International Application Number: PCT/BE98/00124 

(22) International Filing Date: 20 August 1998 (20.08.98) 



(30) Priority Data: 
9700692 



20 August 1997 (20.08.97) 



BE 



(71) Applicants (for all designated States except US): UNIVERSITE 

CATHOLIQUE DE LOUVAIN [BE/BE]; Halles Universi- 
taires, Place de rUniversite* I, B-1348 Louvain-la— Neuve 
(BE). UNIVERSITE DE MONS-HAINAUT [BE/BE]; 
Place du Pare 20, B-7000 Mons (BE). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): KNOOPS, Bernard 
[BE/BE]; Rue Chapelle Notre-Dane 3/1, B-1341 Cer- 
oux-Mousty (BE). HERMANS, Cedric [BE/BE]; Avenue 
des Glycines 42, B-1030 Brussels (BE). BERNARD, 
Alfred [B E/BE]; Avenue de la Chapelle 6, B-1200 Brussels 
(BE). WATTTEZ, Ruddy [BE/BE]; Chemin du Sauvelon 
17, B-7022 Hyon (BE). FALMAGNE, Paul [BE/BE]; Rue 
du Point du Jour 8, B-7022 Mesvin (BE). 

(74) Agents: VAN MALDEREN, Eric et al.; Office Van Malderen, 
Place Reine Fabiola 6/1, B-1083 Brussels (BE). 



(81) Designated States: CA, JP, US, European patent (AT, BE, CH, 
CY, DE, DK, ES, FI, FR, GB, GR, IE, IT, LU, MC, NL^ 
PT, SE). 



Published 

With international search report. 

(88) Date of publication of the international search report: 

15 April 1999(15.04.99) 



(54) Title: PEROXISOME^ASSOCIATED POLYPEPTIDE, NUCLEOTIDE SEQUENCE ENCODING SAID POLYPEPTIDE AND 
THEIR USES IN THE DIAGNOSIS AND/OR THE TREATMENT OF LUNG INJURIES AND DISEASES, AND OF 
OXIDATIVE STRESS-RELATED DISORDERS 



(57) Abstract 

The present invention is related to an isolated and 
purified polypeptide which amino acid sequence presents 
more than 70 % with the sequence SEQ ID NO 1. The 
present invention is also related to the nucleotide sequence 
encoding said amino acid sequence, the inhibitor directed 
against said sequences and their use in the diagnosis, 
treatment and/or prevention of lung injuries or diseases 
and oxidative stress-related disorders. 



CL03TAL V alignment ot human and rat tit amino acid ««qu«nea* I Identity: 
901. Homology: 97.5% J : 

BlBhua M^IKWt^PXVZVTECEPGMITWUVELFKGKKGVl.rcVFCa^TPCCSK - SEQIDNOl 

BISr.c MAPIKVGOTIPSVEVrEGEPCKKWU^LFKJKKCVLrCVPCArrPGCSK 

B IB hum THl.PGrV^QAEALKAKGVgvVACI^VNnArvaCE^CRAHJU^CKVRi^^ 

818 rat TKLPGFVEOAGALKAKCA0V^CL5VNDVTVTA£WGRAH0AEGXVQLIA0 

BlBhun PTGATGXrraiALDDSlAreiFGWWllJOlF^ 

BIB rat PTGAFGKETDL1XDDSLVSLFGHIUUJCRFSMVI DKGWKALKVEPDGTGX 

BIBhum TCSLAPN I I3QL ' 

81 Brat TCSLAPHIUQL 



CUT9TAL V alignment of 
91%, Homology: 96%) : 



and mouse BIB amino acid sequence! (Identity: 



BlBhun 
Bl Bsousa 



BIB hum 
BIBaouae 



BIBhua 
BIBaousa 



BIBhua 
BlBaauftc 



KXPIKVGX^PAVEVTKEPCHXVNlAELrXGXKCJVLrGVPCXrTPGCSK 
MAP I KVGSAI P SVEVFEGE PGKKVKlAELrKGKKGVL rGVPGATT P GCS K 



THIJ»CFVEQAEAIJCAXGV^WACIJVWnAFVT«EilGJ^^ 

THLP GPV I OACALKAXGAOVVACLSVyDVFVI EEWG BAH0AECKVTC1AD 



PTGATCKETDUXtlDS LV5 1 FGNWUJfRrSKVVODGTVKALKVEPOGTGL 
PTGATCKATDUXDDSiVS^rGVa^lCR«MV10»CIVXW>rNfEPDGTGI 



TCSLAPWlISOi 
TCSLAPWILSQ1 



CLUS7A1 V alionaont of huaan and r*t cOKA sequences (identity: 612/78 0, 
7B.5i|: 

Bl Bhua CCGAGGAGGCGGACTGGAAGTGCCCGTGGCGCCCGTATCGGACTAGCTGC 

BlBret TG CGTC -CTACGCAG 

BIBhua CGTGTGCC CCCTGAGACG CT CACCGGGCTAT ATACTCGTCGCTGGGGCCG 

Bl Bra i CAT A— — 

Bl Bhum SCGSTCACTCTGCGCCAGCCGCAGC*AGACgCTGCACTCAAGGACACTGG 
BlBxet 



3NSOOCJD: <V/Q^9S09G54A3J = > » 
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